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ABSTRACT 

Included in this report on Arizona Indian demographic 
data are "an evaluation of several recent studies of Indian 
populations" and "an extensive analysis of methods for obtaining and 
maintaining accurate data in the future." Recommended methods by 
which accurate population data for the smaller reservations should be 
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FOREWORD 



Information is vital to successful program design and implementation. 
This study began under an assumption that reasonably reliable data on 
Arizona’s Indian populations could be readily obtained or developed. The 
research effort demonstrated that securing and maintaining accurate 
demographic data was both more difficult and more important, because 
of the wide discrepancies found in available estimates, than originally 
envisioned. Because of the magnitude and significance of the problem, 
this report has resulted in a program design rather than just a report on 
Indian populations. 

The Department of Economic Planning and Development is pleased 
to be able to publish this report and expresses its gratitude to the authors 
for the work done. Dr. Benjamin J. Taylor is Chairman of the. Economics 
Department and Dr. John Helmkamp is Chairman of the Accounting Depart- 
ment in the College of Business at Arizona State University. 

This report is directed toward those agencies of government with 
programs and responsibilities for aiding the Indian nations in Arizona. It 
is requested that these agencies examine this report and consider implemen- 
tation of a recommended program for maintaining current and reliable 
demographic data, 

Robert G. Worden 
Executive Director 
Arizona Department of 
Economic Planning and 
Development 
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CHAPTER 1 



INTRODUCTION 

The State of Arizona has one of the largest populations of Reser- 
vation Indians in any state in the nation. The 197 0 Census of the Popula- 
tion reports 95, 81Z Indians residing in Arizona, At April 1, I960, the 
Census report 83, 387 Indians in Arizona. The great majority of these 
Indians live on reservations, at least most of the time. 

The 95, 81Z Indians residing in Arizona at April 1, 1970, repres- 

ented 5. 4 percent of the state *s total population of 1, 77Z, 489. This 
ratio is reduced from the 6. 4 percent of the total Arizona population 
represented by Indians in I960, but remains a significant portion of the 
jjopulat n. More importantly, Indian residents of Arizona form a 
much larger share of the state's low income residents. Indians suffer 
disproportionately from unemployment and underemployment, poor 
housing, problems in the delivery of health services and educational 
oppo rtunitie s . 

The many problems faced by Arizona's Indian populations are 
recognized, but a lack of accurate demographic data frequently thwarts 
efforts of state and federal agencies to help solve them, and it obscures 
the actual magnitude of various opportunities as well as deficiencies. 
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The State of Arizona is interested in aiding the Indian Tribes in 
their development efforts. However, the ability of the state to assist 
in plans and programs for improving the economic and social environ- 
ment of the reservations is heavily dependent upon accurate data. The 
magnitude of the information problem can be illustrated by comparing 
recent estimates with U. S. Census figures. 

C urrent Data Problems 

In a recently completed working paper for the Department of 
Economic Planning and Development, Gonzales and Johnson made an 
analysis of Reservation population "census” figures being used at the 

U 

time by the Bureau of Indian Affairs. The Tribal Census total popu- 
lation figures being used were 107, 849 in June of 1969 and 104, 018 in 
March ^ 970. Other sources cited on page 1 1 of that paper indicated 
1970 estimates of 102, 000 to 104, 000. When these estimates are 
compared with the U. S. Census count of 95, 812, it is obvious better data 

are needed^ particularly when it is remembered that the U. S. Census 
count includes all Indians in. the state, not just those on the reservations. 

Apache County in Arizona has one of the larger Indian populations, 
and an examination of the problem of maintaining accurate estimates in 
this county is revealing. The U. S, Census reported 32, 298 residents 
in A^pache County at April 1, 1970, and of this total, 23, 994 or 74. 3 
percent, were Indians. In I960, the Census count for the county was 

1/ Val Gonzales and Eugene Johnson, Integration of the Indian 
Reservations with A rizonans Economic Planning and Development Process , 
Arizona Department of Economic Planning and Development, (Phoenix^ 1 970) , 

pp. 8-10. 



30,438 residents, of which 22,814, or 75 percent, were Indian. The 
Unemployment Compensation Division of the Arizona Employment 
Security Commission uses accepted Bureau of the Census methods 
in preparing population estimates for Arizona's counties. For July 1, 
1969, it estimated the population of Apache County as 46, 500, substan- 
tially above the 1970 Census count of 32, 298. 

Information Needs 

The reason for the apparent over estimate of tne population of 
Apache County reflected above is not difficult to explain. Records of 
vital statistics show the ten-year natural increase (number of births 
over number of deaths) in the County's population was 11, 897. Also, 
of the 14, 644 births in the County betv^een I960 and 1970, 11, 487 or 
7o 4 percent were Indians. Therefore, if the U. S. Census counts are 
accurate, there was a substantial migration of Indians from the county. 

An analysis of Census counts and vital statistics data indicates a total 
out-migration from the county during the decade of 10, 037 persons. 

Without a method for maintaining information on these population 
movements, it is difficult to assist in the development of the reservations. 
Prospective industries require reliable labor force data before investing 
in plants and equipment. These data are also needed for establishing 
priorities for public investments in programs and facilities to alleviate 
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the problems of manpower training, housing, health and welfare which 
beset the Indian peoples. 

Objective s 

This report includes, first, an evaluation of several " ecent 
studies of Indian populations and, secondly, an extensive analysis of 
methods for obtaining and maintaining accurate data in the future. 
Recommendations of methods by which accurate population data for the 
smaller reservations should be maintained are included in the first 
portion of the report. Various sampling techniques for use on the 
larger reservations are evaluated and methods are expla.ined in detail 
in the second portion of this paper „ It is hoped that resources will be 
available to maintain a.ccurate data on the Indian populations to enable 
state and federal agencies to assist in solving ^ome of the pressing 
problems faced by reservation Indians. 
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CHAPTER Z 



OVERALL EVALUATION OF INOIAN 
DEMOGRAPHIC DATA FOR ARIZONA 

Indian demographic studies are available, but in limited quantity 
and quality. !Many studies have been conducted on various reservations, 
but most of them are not generally accessible for review or use. It is 
clear from the studies, such as are available, that there are several 
outstanding weaknes...es in all of the attempts to develop reliable data. 
One serious problem is the poor quality of reservation population rolls 
from which reliable samples may be drawn. Not a single reservation 
in the State of Arizona maintains a well-edited population roll which is 
“to — da,.e at all tim.es. The fact that Indians are far more mobile than 
is usually realized makes such a task one of monumental proportions. 
This very basic deficiency makes it extremely difficult to develop data 
over any prolonged period of time. It should be realized, however, 
that maintenance of up-dated population rolls on all reservations at 
ail times would be expensive and may be unnecessary. 

Previous Studies 

Ivlost studies are usually one-shot efforts that more likely than 
not are only reasonably characteristic of the on-reservation population 
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during the finite period in which the data were collected. There is no 
valid reason, for exs-inple, to assume that the data collected in June 
are characteristic of the on- re se rvation population in any other month 
of the year. This problem exists because approximately one -half of all 
Indians reside off their reservations at any given period of time. There 
is evidence that this one -half of the total population changes substantially 
from one period of time to another. Thus, longitudinal studies of the 
demographic characteristics of Indians are desirable if more adequate 
data are to be developed which will generate more conclusive findings 
regarding Indian manpower characteristics. 

Efforts to use various censuses of several reservations have 
been made; however, such efforts have proven less reliable than sanaples 
because of the vast mobility of Indians. A census does not permit cal- 
culation of error whereas sample errors are calculable. Again, a 
census at one period of time on a particular reservation, assuming all 
on-reservation Indians are interviewed, could result in substantially 
different findings from one census to the next becauje of Indian migration. 
Relatively inexpensive sampling methods should be utilized to obtain the 
basic data needed about Indians which would, at the same time, not make 
subsequent samples prohibitive in the event that a check is desired on 



the initial sample. 



The single most pressing problem that should be resolved before 
any serious attention can be devoted to developing and maintaining 
current data on reservation Indians is maintenance of carefully edited 
population rolls on a selective basis. The tribes themselves are in the 
best position to accomplish this task, A clerk will probably have to be 
maintained on the larger reservations who can devote a proportion of 
her total work time to this function. On small reservations, less time 
would be required of an employee to fulfill this function. The Department 
of Economic Planning and Development may find it necessary to assist 
tribes or some group, such as Indian Development Districts of Arizona, 
to obtain funds for the purpose. Continuously well-maintained population 
rolls would permit development of demographic data at minimal cost. 

Calculated Cost of Mainta ining Rolls 

Only eight of the twenty reservations listed on Table 1 would 
require almost continuous attention to maintain reasonably correct 
population rolls. All of the others should be developed initially and 
then edited only once per year, unless more frequent editing is required 
for sampling. It is assumed that continuous effort would be required 
to maintain rolls on reservations with populations of 1, 500 or more. 

After the initial rolls are developed for reservations with less than 
1, 500 population, no more than $500 should be required to bring them 
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Table 1 



Comparison of Population Figures Reported 
By Four Principal Agencies 





IDDA® 


ACIA° 


W 
1— 1 

> 

1 — ' 

o 


1 1 

IHS 


T ribe 


Mar 69 


May 7 0 


Mar 70 


Jul 7 0 


Ak Chin 


220 


240 


240 


400 


Camp Verde 


607 


314 


314 


686 


Cocopah 


99 


63 


64 


- 


Colorado River 


1, 620 


1, 297 


1, 297 


1, 430 


Fort Apache 


5, 712 


5, 953 


5, 953 


5, 800 


Fort McDowell 


261 


280 


280 


330 


Gila River 


6, 140 


5, 241 


5, 241 


6, 450 


Gila Bend 


103 


- 


44 


262 


Havasupai 


365 


270 




- 


Hopi 


4, 966 


6, 000 


o o 
/ 


7, 250 


Hualapa 


682 


1, 028 


68. 


1, 520 


Kaib a,b - . aiute 


140 


60 


9 


- 


Navajo 


69, 631 


71, 396 


71 - . 


- 


Papago 


4, 544 


5, 506 


4, 6S8 


7, 900 


Salt River 


2, 212 


2, 040 


2, 0^0 


2, 265 


San Carlos 


4, 583 


4, 404 


4, 404 


5, 050 


Yavapai -Prescott 85 


90 


90 


90 


San Xavier 


558 


- 


574 


~ 


Totals ^ ^ 


102, 528 


104, 182^= 


102, 803 


39, 833'i 



^Total not complete - -several figures not available. 

^Total not complete - -seve ral figures not available. 

®Four Corners Regional Commission Technical Assistance Project, 
Arizona, Pr .liminary State Development Plan, Arizona Department 
of Economic Planning and Development, March 1969, Phoenix. 

^Arizona Commission of Indian Affairs, Tribal Directory, 1970, a 
pamphlet. 

^^Bureau of Indian Affairs, Reservation Programs, March 1970, op. cit. 

^ ^Indian Health Service, Sacaton Service Unit, Program Plan, Phoenix 
Area and Personal Interview with Phil Pepper and Charles Green. 

1 Z 

Although population figures are not available for all the tribes in the 
Arizona Commission for Indian Affairs and the Indian Health Service 
columns, the discrepancies rer,..ected in each agency's figures for the 
respective tribe are apparently obvious. 



S ource: Taken from Val Gonzales and Eugene Johnson, "Integration of the 

Ihclian Reservations with Arizona's Economic Planning and Development". 



Y Mimeographed Paper, August 1970. 
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would be involved for 



completely up-to-date. Very little expense 
reservations such as Ak Chin, Cocopah and Yavapai-Pre s cott. Know- 
ledgeable on-rese rvation informants could edit small lists in about one 
day. It is estimated that approximately $7, 500 would be required to 
obtain initial working lists for all the Arizona reservation^ exclusive 
of the Navajo. This figure does not include the full time cc 't ot me staff 

person's salary for approximately three months. 

Editing difficulties will accelerate with reservation s Eve 

knowledgeable informants will be unable to maintain a perfectly ccur te 
list. Keeping up with births and deaths is not an impossible ;a- , bur 

there is a severe limitation in attempting to learn which Indian: are 

on or off the reservation at any moment of time. Reservations are 
usually composed of vast acreages which make it impossible for even 
the more informed Indians to know of all of the migration. But consistent 
effort can minimize the number of mistakes. This observation suggests 
that editing should take place at the time a survey is to be conducted 
and that continuous editing would be unnecessary. This is particularly 
the situation if primary emphasis is placed on the age 16 and over 
population as opposed to the total population of each reservation. 

There is still another very important problem associated with 
attempts to collect Indian reservation dem.ographic data. Indian tribes 
are very reluctant to cooperate with researchers. Usually, data are 
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collected within two to three weeks after all other arrangements have 
been made, but the preliminary arrangements may take as long as six 
months. Tribal councils require long periods of time prior to deciding 
to cooperate with some group that wishes to collect data. The Indian 
is suspicious of the objectives of most groups that want to survey their 
reservations. Any agency or firm that requires elementary information 
concerning Indians should be aware of the public relations time that may 
be required to obtain tribal cooperation. For this reason, it is highly 
probable that all Indian tribes will not support every request to run a 
sample. This problem may make it virtually impossible to conduct 
longitudinal studies to obtain more accurate demographic data. A 
single study of any given Indian reservation may be the very best that 
may be expected. It is suggested, therefore, that once permission is 
granted to sample a given reservation, an attempt should be made to 
obtain information on as many aspects of Indian life that may possibly 
be needed for planning. The incremental cost of doing so would also be 
very small compared to the expense associated with a nev^ sample each 
time information is needed on any given subject. 

evaluation of three reservation population studies 

Indian Manpower Resources in the Southwest: A Pilot Study 

Sampling Plan . The overall strategy of the sample involved 
several steps. First, a questionnaire was developed and tested. 

Benjamin J. Taylor and Dennis J. O'Connor, Indian Manpower 
R esources in the Southwest ; A Pilot Study, Arizona State University, 
(Tempe, Arizona, 1969 ), 



The questions generall-^^ concentrated on individual experiences that 
would permit: extensive description of the nature of Indian manpower 
utilization. Second, population lists were developed for each reserva- 
tion for the purpose of sample selection. Finally, interviewers from 
each reservation were selected and “rained to deal with the perceivable 
problems that might arise in the conduct of interviewing. Continuous 
direction was provided the Indian interviewers during the interview 
period. 



Population Lists : The population lists on the different reserva- 

tions came from different sources and presented different problems, 

The most reliable tribal rolls for the two reservations in New Mexico 
were maintained by the Bureau of Indian Affairs. Thei:>e tribal rolls 
provided the primary sampling frame for the Pueblo Laguna and the 
Pueblo of Acoma, The rolls were first edited by tribal officials and 
then later by different members of the tribes. Persons not known to 
the individuals editing the population lists were included in the sampling 
frame Sp 

The inclusion of unknowns in the sampling frames did not intro- 
duce a sampling bias since if they were not on the reservation, even 
though their name appeared in the sample, the non-response was from 
a person not in the population of interest. The standard error did tend 
to rise because of this feature, but remained within manageable limits, 

O 
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The e procedure w. followed on all five reservations included in 

the study. The basic problem of this approach is that there is no 
assurance that the population remains the same^ even for periods as 
short as a week or a month. Demographic characteristics, therefor , 
are subject to violent change. 

Sample Size. The sample size for each reservation is related 
to population characteristics, such as proportions and means, that 
were to be developed in the survey. Since the questionnaire focuses 
primarily on frequency of response to certain questions, the sample 
size was determined with reference to the standard error of the propor- 
tion. 

The specification adopted for the planned error is that the standard 
error for a ZO percent characteristic not exceed 1. 5 percentage points. 
This means that a sample estimate of ZO percent should be capable of 
establishing a 95 percent confidence limit of ZO + 1. 96 (1. 5), or approx- 
imately 17. 0 - Z3. 0 percent, for the true value for the entire reservation. 
This is a relative error of I. 5/ZO or 7. 5 percent. The relative errors 
for characteristics exceeding ZO percent will then be less than 7. 5 per- 
cent. For characteristics occurring with less than ZO percent frequency, 
the expected sampling errors will exceed the 7. 5 percent figure. 

Given the restriction that the relative error for 20 percent 
characteristics not exceed 7. 5 percent, random sampling theory is used 



IZ 

16 



to find the requisite sample size. Taking account of the finite sizes of 
the populations sampled, the error formula is 

= T N - n 
np N 



Where V is the error limitation of . 075, i. e. , 7. 5 percent 
p is . 20, i. e. , 20 percent 
q is 1-. 20, i. e. , 80 percent 
N is the size of tha population 
n is the sample size 



Restated to solve for sample size, this formula becomes 



Nq 

n = ^ 

V^Np + q 



. SON 

(ToTsT^TTToy^rTT^ 



For some illustrative population sizes, the required sample 
sizes are shown below: 



Population F rame 



Sample Size 



1000 420 
2000 530 
3000 580 
4000 610 
5000 630 
Infinite 710 



These sample sizes will yield estimates for the mean of a 
normally distributed variable that have high assurance (95 percent) of 
being correct within 2. 6 percent; 95 percent confidence limits can be 
established for the mean of a highly skewed variable within 5. 4 percent. 
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The relative errors for percentage characteristics or for totals based 
upon them will depend on the sample percentage obtained. These 
sample sizes will provide estimates within 7. 6 percent of the true 
value (with 95 percent confidence) for 50 percent characteristics, within 

15 percent of the true value for 20 percent characteristics (the original 
error specification), and within 22 percent of the true value for 10 
percent characteristics. 

Table II shows both the sample sizes that were planned and those 
that actually resulted. Those planned for each reservation were for the 
purpose of keeping the standard error of the proportion at the level of 
. 015. The planned sample sizes ranged from 500 for the Acoma Reser- 
vation, which was the smallest community in the study, to 580 for the 
Papago Reservation, which was the largest reservation in terms of the 

16 years of age and over on-reservation population. Despite the fact 
that there were more than twice as many names on the Papago popula- 
tion list as compared to the Laguna population list, the difference in the 
planned sample sizes was only 80. 

Table 2 . 

Planned and Actual Sample Sizes 





Planned 


Actual 


Laguna 


527 


261 


Acoma 


500 


240 


Pagago 


580 


382 


Fort Apache 


550 


554 


San Carlos 


524 


350 
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The actual size of the sample for each reservation was affected 
by non-responses on each reservation. The fact that some of the sampling 
frames used included many people who no longer resided on the reserva- 
tion resulted in reduced sample sizes. On the Laguna Reservation, for 
example, 527 names were selected from the modified population list 
provided by the Bureau of Indian Affairs. Approximately one -half of 
those selected were actually on the reservation and as a result the sample 
size turned out to be 261. 

On the Fort Apache Reservation, a somewhat different procedure 
was followed. Since it was surveyed after both the Laguna and the Acoma 
Reservations, experience made it seem obvious that the actual sample 
size would be smaller than the planned sample size unless the sample 
was inflated. A pretest of the population list was run to determine its 
accuracy. A sample of 686 was drawn in the hopes of getting 550 
responses. This technique worked well since 554 usable responses 
were obtained from the effort. This is the only reservation for which 
the standard errors are equal to the desired levels. In all other cases, 
the standard errors of the proportions are larger than intended. These 
standard errors are not so large as to make interpretation of the statis- 
tical results difficult. The standard errors of the proportions for a 
20 percent characteristic, given the actual sample sizes used on each 
reservation, are reported in Table 3. The preceding discussion 

15 
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Table 3 



Actual Standard Errors for a 20 Percent Proportion 

By Reservation 
(Planned error was . 015) 



Laguna 

Acoma 

Papago 



. 021 
. 022 
. 019 
. 015 
. 020 



Fort Apache 
San Carlos 



applies to sampling errors associated with estimates of the population 
as a whole. For purposes of analysis, various subgroups of the popula- 
tion were examined. Some of these subgroups are relatively small and 
the sampling errors associated with these subgroups are larger than 
those of the general population.. Information for some of the small 
subgroups is presented even though sampling errors are large. This is 
done because some of the information is very suggestive and because 
little other information on the particular subject is available. The reader 
is, however, advised to interpret subgroup data with caution. Differences 
in subgroup data must be large if the differences are to be statistically 
significant. 

Procedu re for Drawing Sampling Units 

The procedure for selecting sampling units was structured so 
that the advantages of geographic stratification would be present. The 
sampling lists were organized so that all people in a given community 
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were grouped together. The sampling list for each reservation may 
be viewed as a series of lists representing the smaller communities 
on the reservation. Each of the smaller communities represents a 
strata within the list. The number of strata for each reservation is 
reported in Table 4. 



Table 4 

Number of Communities on Each Reservation 



Laguna 


7 


Acoma 


2 


Papago 


54 


Fort ^ pache 


10 


San Carlos 


11 



The communities vary widely in size. In some cases, the popula- 
tion of a community xiiay be less than Z5 persons. In other cases, the 
population for a community may be 1, 000 or more. 

The sampling takes advantage of any tendency for the character- 
istics under investigation to vary systematically by community. These 
gains in accuracy from effective stratification by community tend to 
reduce the sampling error below that of an unrestricted random sample 
of comparable size. On the other hand, inability to completely enumerate 
names selected in the sample and errors remaining in the edited reser- 
vation lists tend to increase sampling error. 



Interpenetrating replicate subsamples were drawn from the 



stratified sampling list. That is, instead of drawing one large sample, 
a number of smaller samples were drawn. The number of subsamples 
was set at ten. For example, if a sample of 400 was desired, 10 sub- 
samples of 40 were drawn- The number of subsamples selected is large 
enough to make possible calculation efficiencies, but it is not too large 
as to interfere with the stratification of the sampling list- 

The first step in the selection of the sampling units was to divide 
the sampling list into equal zones. The number of zones in the sampling 
list is equal to the number of elements in a subsample. Each and every 
zone provides one observation for each of the ten subsamples. Thus, if 
the sampling list contained 4, 000 names and a sa^mple of 400 is desired, 
then the sampling list would be divided into 40 zones, each with 100 names. 
The selection of the elements to be included in each subsample 
was made on a systematic basis. The first unit of each subsample was 
selected at random from the first zone of the sampling list. The sub- 
sequent units included in each subsample were selected by adding a 
number equal to the zoning interval to the random start. An example of 
a sample selection table is illustrated in Table 5, This table assumes 
that there is a list of 4, 000 names from which 10 subsamples of 40 each 
will be drawn. This means that there are 40 zones with 100 names in 
each zone. Selection of the starting numbers is without replacement. 




18 




Table 5 



An Example of a Sample Selection Table 



Subsample 



Zone 


1 


2 


3 


4 


#0001 - 0100 


55 


27 


68 


etc. 


#0101 - 0200 


155 


127 


168 


etc. 


#0201 - 0300 


255 


227 


268 


etc. 


#3901 - 4000 


3955 


3927 


3968 


etc. 



Strength of the Method 

There are several strengths of the method as designed. It achieves 
the advantages of geographic stratification. The larger comnmunities on 
each reservation are weighted more heavily in terms of sampling units 
selected than the smaller communities. In addition, by concentrating 
the interviewers in different communities, any interviewer bias that 
might be pi'esent is spread over a number of subsamples. The method 
is also convenient for making the calculation of standard errors routine 
and simple. Since there are hundreds of estimates of proportions for 
each reservation, this last advantage is of major significance. The 
standard error of any proportion can be estimated through the examina- 
tion of subsample proportions. The standard error can be calculated 
by subtracting the lowest subsample proportion, the highest subsample 
proportion and dividing by the number of subsamples. The estimate of 
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p 



P max - P min 

To 



the total proportion is computed through the combination of all ten sub- 
sample frequencies. If the population roll is highly accurate, reasonable 
estimates may be obtained because the response to various questions 
will be larger than if the roll from which the sample is drawn does not 
reflect high accuracy. 

The book Indian Manpower Resources in the Southwest: A P ilot 
Study contains detailed characteristics of reservation populations, details 
about the employed and unemployed, extent of education and training, 
consumption and expenditure pcitterns, skill and industry experiences, 
and sources of income. 

Specific categories include: 

1. Population by age group, six, and education attainments. 

2. Occupation and industry experiences of the working age 
populations over a five-year period. 

3. Labor force participation rates by sex, age, and educational 
level. 

4. Under utilization of the manpower resource. 

5. Labor force status by sex, age and education. 

6. Source of training for usual job. 

7. Place (on or off reservation) where work is performed. 

8. Ability to communicate in the English language. 



9. Disabilities preventing individuals from working. 

10. Source and amount of income received by individuals and 
families during the previous year. 

11. Consumption and expenditure patterns. 

Data collected are useful for a variety of purposes. They will 
give, if data are current, prospective employers insights into the skill 
availability of reservation Indians. Also, they will assist employers 
and relevant government agencies in determining which skills training 
programs are more likely to be successful. 

The data will not provide prospective employers with ir;^ rmation 
regarding the skills possessed by off reservation Indians who v c. -d 
pro oably return to their reservations if job opportunities shoul evelop. 
Thus, the book should be taken in certain cases as revealing minimal 
characteristics of reservation Indian potential for development. 

INTEGRATION OF THE INDIAN RESERVATIONS 
WITH ARIZONA'S ECONOMIC 
PLANNING AND DEVELOPMENT PROCESS 

This is a working paper prepared for the Department of Economic 
Planning and Development by Val Gonzales and Eugene Johnson. It is 
comprised primarily of an attempt to find and list existing documents 
that deal with the demographic features of two reservations - The 
Papago and The Gila River. Other reservations w’ere included, but the 
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major effort was devoted to the two reservations mentioned. Much of 
the effort was devoted to explaining to tribal leaders the need and 
importance of economic planning and development of reservation 
economies. 

The report provides population estimates of the various Arizona 
reservations made by four groups. The discrepancies observed in the 
four estimates reflect not only the differences in procedures used, 

but also the different times of the year that each group made the estimate. 
Neverthe ess, the die 'repancies reveal the problem facing researchers- 
V O attempt to establish sampling devices to obtain valid demographic 
data. 

Much of the report is an account of the difficulties that Gonzales 
and Johnson encountered in their attempts to obtain the cooperation 
of tribal governments to develop accurate tribal rolls. The basic reason 
that widespread discrepancies exist is that the tribes do not maintain 
adequate rolls themselves and display an unwillingness to cooperate 
with groups that would welcome the opportunity to do so. 

The report is primarily an ^4n~house" working paper with 
limited value to others. The document's greatest value is the documen- 
tation of the extreme difficulty that researchers can expect to encounter 
when they try to obtain Indian cooperation in almost any project imagin- 
able. Even Indians who seek tribal assistance find they have considerable 
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difficulty with the various tribes. Not ail tribes, however, xhibit 
the same degree of difficulty as can be expected from the Pa.pago. The 
Papago Reservation has had a survey run almost continuously during 
recent years because of the activities of the Indian Health Service 
ar.d the Unive.'sity of Arizona. Other tribes have had little cf such 
activity and can be expected to be mo a cooperative. 

NAVAJO MANPOWER SURVEY 

This report was a result of v\ rk by the Navajo Ma:mower Survey 
Task Force soonsored by Navajo Tr: .e, Indian Health Ser -ice. Bureau 
of Indian Affairs, Arizona State Emnloyment Service, and Office of 

3 / 

Navajo Economic Opportunity. 

The Navajo Task Force obtained a population register from the 
Bureau of Indian Affairs, Navajo Area Office. The population register 
is maintained by the 72 B. I. A. school districts. The school districts 
were combined and 18 universes were formed to establish districts of 
at least 3, 000 persons each (14 years of age and older). Except in the 
more industrialized sections, at least a 10 percent sample was drawn 
from the districts. 

There was no discussion of difficulties with the drawn sample. 
That is, no indication was given regarding the quality of the B. I. A. 
maintained population rolls. School district rolls may not focus on 
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3/ Bernd Schwartz, Navajo Manpower Survey , Arizona State 
anpLovment Service (Phoenix, Arizona, January 1970). 



27 



23 



families without school age children. Many questions come to mind 
rega 'ding the qi alit‘ of the populatio;; lists which could lead to sub- 
stantial bias in the data. 

It was indicated that more than a 10 percent sample was drawn 
in the more industrialized sections of the reservation. No mention was 
made of adjusr nents to insure that s ich a practice did not bias the 
overall characteristics of the Navajc population. Considerably more 
information ccncerning the method used to obtain data is necessary 
before the study can be adequately judged. 

The Navajo Manpower Survey provides a brief description of the 
evolvement of the socio-economic situation on the Reservation. It also 
speculates regarding the political structure of the Reservation which has 
generated considerable unity of the various factions with regard to 
utilization of their resources. Substantial emphasis is placed on the 
Navajo desire to attract industry to the reservation as opposed to train- 
ing of individuals to go to distant areas to compete effectively for jobs 
with non -Indians. In this regard, the Navajo Manpower Survey proceeds 
with the same premise as other studies that focus on Indian manpower. 
The Indian is not different from other groups in the population in that 
all prefer to remain in more familiar surroundings. Once the basic 
premise is stated, the study proceeds to describe the characteristics 
of the Navajo population, aged 14 and over. 
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Characteristics of the ma:_power resource focused primarily 
on the non-stud ‘r:.t population bee. use of the difficulty found in attempts 
to contact stude :s. The failure j >-icluce students in the study could 
result in a subs: _ntial understates mt of the population that might prefer 
employment, If such were available. That is, many students were 
enrolled in off-rese rvation boarding schools who may otherwise have 
remained on the reservation t work and forego schooling, they were 
given that choice. However, t-iis situation has been communicated to 
the reader and does not seriously distort the findings because the 
student population is treated separately. 

The study is similar to Inaian Manpower Resources in the Southwest 
in that it contains data on the population by age grouping, educational 
attainment, use of the English language, and the desire for training. 

The Navajo study also treats the characteristics of the labor 
force in a manner that is not consistent with either definition used by 
the United States Department of Labor or the Indian Manpower Resource 
Study, The Navajo study included a person in the labor force if he indica- 
ted he desired to work. This definition overstates the size of the labor 
force relative to the official definition of that concept. However, this 
is a debilitating problem only if there is a desire to compare the Navajo 
to the United States or some subgroup of the country. If the data are 
to be used only to illustrate vhat one might reasonably expect to find 



the definition is acceptable. Navajo 



on the Navajo Reservation, 

Manpower Survey recogn_z :s problem. 

Labor force particii;. .,n by age and sex is illustrated in the 
study. It also provides in. n-;ion on educational attainment of the 

labor force and calculates ,.nn schooling which are useful measures 

for prospective employers r: ng location sites. Navajo occupational 

experience is included in th ; dy. Such data are highly useful because 
they indicate the degree of ...n experience with various categories of 
labor force discipline. Stab — *:y of the Indian in an employment relation- 
ship can be inferred from the study. Also, the size of the "available 
pool" of workers is indicated the data. This can be inferred from the 
responses of those participating in the survey, but, of course, such 
responses should be approached with caution. Those who stated they 
desired employment may, in fnzt, be unemployable in most occupational 
categories, if not all of them. Other problems are associated with 
such responses. 

Characteristics of tine employed and unemployed Navajos are 
also provided in convenient manner. These data are broken into the 
categories of usual occupation, educational attainment, and age and sex. 

In brief, the Navajo Manpower Survey provides a reasonable 
illustration of the reservat’ n Indian* s labor force characteristics at 
the ^ t^ie survey was made . There were no income and consumption 
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data, such as were included in the India n Manpower Resources Study, 



but the Navajo study had an objective that was somewhat different from 
it. The Navajo survey is an important document that will serve as a 
primary source on labor force data until adequate funds become avail- 
able to replace it. The major problem with the study is the horrendous 
problems associated with the sample technique. The data should be 
used with caution. 

Bureau of Indian Affairs and Indian Health Service Studies, 
various Bureau of Indian Affairs offices and Indian Health Service units 
conduct studies that deal with demographic features of reservation 
Indians. Most of these tend to be informal working papers with a limited 
purpose. Some eventually find their way into print, but most become 
lost in storage. The sampling methods used vary drastically from 
highly scientific to the naive. Usually, it is almost impossible to obtain 
information about the method used because of personnel turnover in 
those agencies, and, thus, most of these documents must be ignored 
even if they could be uncovered. 

The Indian Health Service servicing the Papago Reservation is 
most active in assembling data for the Papago Indian and should be 
utilized prior to any surveying anticipated for that tribe. Much of the 
information needed may be in their possession. The Bureau of Ethnic 
Studies at the University of Arizona has also assembled data on the 
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Papago Indian and is also a valuable source of certain types of demo- 
graphic data which may preclude the necessity of having to sample 
for certain information. 

The other reservations have not had the same magnitude of 
activity, and, therefore, greater effort will be associated with obtain- 
ing basic demographic data. It should be noted that the Papago and 
Navajo reservations pose the greatest sampling problems. All others 
are relatively simpler. This is due partly to the relatively larger 
geographic size of these two reservations and partly to the greater 
dispersion of their on-reservation populations. 
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CHAPTER 3 



GENERAL SAMPLING STRATEGY 
The objective of the sample design stage of ihe Indian study is 
to insure that the relevant statistical measures listed on the questionnaire 
and discussed earlier are estimated on a probilistic basis that insures 
objective and unbiased results. A wide selection of sample survey 
designs are available for collecting the important statistics concerning 
Indian life. Four specific plans are recommended in this section to 
achieve the survey objectives described in other parts of this study. 

The methods are: (1) simple random sampling; (Z) stratified simple 

random sampling; (3) cluster sampling; and (4) systematic sampling. 

Each plan has certain advantages and disadvantages which must be 
evaluated carefully to insure that an efficient sample design is achieved. 
An efficient sample plan is one which achieves a high degree of precision 
per unit of cost. Precision in this sense represents a measure of how 
close an estimate is to the true value being estimated. The two major 
dimensions which must be evaluated for this objective are: (1) The 

amount of variation and skewness in the frequency of occurrence of the 
statistical measures of interest; (2) The amount of geographical dis- 
persion on a given Indian reservation population. Since the entire 
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populs-tion is not 6niiin6 ra.t6d, tli6 S3.mpl6r must consid6r th© risk of 3. 
sampling error as measured by the standard deviation of the sample 
mean (hereafter referred to as the standard error). This error arises 
because we cannot expect the estimated results to be exactly the same 
as the true values involved so the concern is to control any differences 
within probilistic limits. 

Since the precision of the results obtained from a sample survey 
depends primarily on the sample size and method of sample selection, 
different efficiency levels can be achieved by choosing among the sampling 
plans described in this section. Consequently, the sample size required 
to attain a desired precision-cost objective can be altered through the 
choice of the sampling method. When future surveys of Indian popula- 
tions are made, the sampler should consider the prevailing conditions 
of a particular project (such as the resources available, objectives of 
the sample, precision demands, geographical dispersion on the reser- 
vation, and the amount of variation between the statistical lives of 
different Indians) and match them -with the characteristics discussed 
in this section to attain a satisfactory efficiency level. As a result, 
different plan.s may be chosen at different times to control the standard 
error within acceptable boundaries at a reasonable cost, thus achieving 
varying precision per unit of cost outcomes. Once a given plan is 
selected as being the most appropriate for a specific reservation, 
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it normally would be used consistently over time to insure comparable 
results unless significant changes occur. 

A complete sample program is presented in this section lor each 
of the plans to provide a procedural description with which future samples 
of Indian data can be drawn. The population elements are defined as all 
Indians who are sixteen years of age or older and the statistics of inter- 
est for this study are those listed on the questionnaire shown in Appendix 
A. 

Many of the statistics listed on the questionnaire are either 
estimates of attributes (frequency of occurrence of some characteristic 
such as a given income level) or estimates of variables measurements 
(average or total values of measures of interest, such as the average 
individual Indian income for a given reservation). Since we did not 
have an actual situation to work with in this study, certain assumptions 
were made to provide an illustrative base for describing the various 
sampling techniques. In a real life application of the sample plans, 
the questionnaire shewn in Appendix A will serve as the instrument 
with which the vital Indian statistics will be retrieved. However, in 
the description of the different plans in this section, the focus is on 
income data exclusively and the hypothetical individual Indian income 
population shown in Appendix C is used for illustrative purposes. Since 
most of the statistical measures of interest on the questionnaire shown 
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in Appendix A are either directly or indirectly related to Indian income, 
the income focus provides an effective base for administering the survey 
anyway. This point will be discussed more completely later. The popu- 
lation of 1, 000 numbers representing the income levels of the Indians 
of a hypothetical reservation provides the basis for demonstrating the 
sample selection process with the various plans. In actual applications, 
the same analysis would be extended to the other measures recorded on 
the questionnaire. The income values were simulated from the results 
of an earlier study by Taylor and O'Connell, Indian Manpower Resources 
In th e Southwest . 

We assume that sampling without replacement is used throughout 
the analysis. Therefore, once a given Indian has been selected, he 
cannot bo chosen again for inclusion in the sample. We a,lso assume 
that appropriate record keeping of the population rolls will be achieved 
for each Indian reservation of interest (see an earlier section for a 
related discussion). Consequently, movements to and from the reser- 
vation will be recorded continuously and the population roll will be 
accurate any time a sample is drawn. Obviously, this factor is extremely 
important since the sample results of any survey cannot be any more 
representative of the true population measures of interest than the data 
recorded on the list from which the sample is drawn. Thus, the only 
type of error considered in this study is the sampling error. We 
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recommend strongly that an adequate investment in a record keeping 

system for the maintenance of reliable population rolls made and 

that periodic checks be performed to confirm that th 2 records are 

correct. If it is discovered that a significant amount of j ovemeiit on 

and off the reservation occurs but cannot be effectively recorded, an 

attempt should be made to determine the impact of this movemezit on 

the estimated statistics of interest. One suggestion to accomplish this 

objective would be to estimate several mean values of measures of 

interest shown on the questionnaire with and without the transient 

Indians whose presence or absence is unrecorded. The hypothesis to 

be tested in such cases would be whether or not a bias is introduced 

for the sample results as a function of the transient Indians, A simple 

means test can be established to test whether or not the mean values 

of the two distributions are significantly equal from a statistical Pers- 
ia 

pective. 

Another assumption in this analysis is that a labor cost of $3, 00 
per hour will be incurred in the interview and processing phases of the 
study. Also, a $5. 00 per hour charge will be incurred for the admin- 
istration and analysis stages. The costs of obtaining and analyzing 

4/ For s.n example of testing the difference between two means, 
see the following reference: William Mendenhall, Introduction to 

Statistics , (Belmont, California, Wadsworth Publishing Company, 1964), 
p. 151. 
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Indian statistics can be divided into the preliminary sample stage and 
the regular sample stage (which is an extension of the preliminary 
stage to include the total sample selected). The relevant costs can be 
approximated by the following functions: 



TC = PC SC 

PC := F + cN + dni + enj^ + bnj^ 

WSC = d nt - ni^ + e n^- - n^ + g (n^ ) 



Total costs of planning, obtaining and processing the 
sample results for a given reservation. 

Costs incurred in the preliminaT'y sample stage. 

Costs incurred to extend the preliminary sample to 
include a sufficiently large number of units. , 

Fixed costs associated with such functions as admin- 
istration, training interviewers, elerical, etc. 

Variable cost rate per element associated with dealing 
with the population elements in the preliminary stage, 
(i. e. , random number assignment and determining a 
classification strategy if required). 

N = Population size. 

d = Variable cost rate per element of traveling between 
sample elements, 
nj^ = Preliminary sample size. 

e = Variable cost rate per element of interviewing sample 
elements and reviewing the results, 
b = Variable cost rate per element of dealing with the 
preliminary sample elements to estimate the mean 
income and standard deviation for the determination 
of an appropriate sample size. 
n|; = Sample size selected from the preliminary study to 

achieve sufficiently precise results, 
g = Variable cost of processing and analyzing the statistical 
results recorded on the questionnaire during the inter- 
views. 



Where: 

TC = 
PC 

SC = 
F 



W In certain methods, the preliminary sample may not be 
fully extended in the regular sampling stage because of the character- 
istics of a given technique so the cost functions must be altered accord- 



In order to evaluate each of the four sampling plans, the follow- 



ing organization is used for each method: (1) basic nature; (2) sample 

size determination; (3) sample program; (4) simulated sample results. 

SIMPLE RANDOM SAMPLING 

Overview - The most basic form of sampling is simple random 
sampling, which means that each element (eligible Indian) in the popula- 
tion has an equal chance of being selected for the sample. No attempt 
is made to separate the population prior to the sample selection phase. 
Each element in the population is assigned an identification number, a 
proper sample size is selected, and a table of random digits is used to 
determine the units to be included in the sample. The interviewer than 
will visit ea.ch of the Indians selected for the sample and ask the various 
questions recorded on the questionnaire shown in Appendix A. The two 
major estixmators used in this study are the sam.ple mean (an estimate 
of "how much") and the sample proportion (an estimate of "how many") 
which are shown in Appendix B, along with their respective standard 
errors. The sample mean concept is used exclusively to determine 
an appropriate sample size in this study. 

The major advantage of simple random sampling is its simplicity. 
Also, it is beneficial to consider its basic characteristics, since they 
provide the foundation for other sampling methods. The population does 
not have to be divided in any way before the sample is selected so a great 
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deal o£ sophisticated analysis of the population characteristics is 
unnecessary. When the variation among the various Indian incomes 
and other statistics of interest is narrow, simple random samp] ’ j 
may provide a sufficiently reliable estimator within acceptable cost 
boundaries and produce results which can be controlled within probabil- 
istic limits. However, an analysis of the standard error with simple 
random sampling shown in Appendix B reveals its major disadvantage. 

If a great deal of variance between income levels exists, a sample size 
which is so large it is prohibitively costly may be required. In such 
cases, the sampler may be able to achieve more precision per unit of 
cost with another sampling method. Or geographical dispersion may 
be so wide that unacceptable costs will be required, to achieve satis- 
factory sample results. In this case, a cluster sampling approach will 
provide lower costs, although the degree of precision may diminish, 
so this method must be evaluated carefully to achieve the proper pre- 
cision-cost tradeoff. 

Sample size selection - The standard error should be used to 
estimate the sample size required for a desired degree of precision. 

Two approaches are possible to achieve this objective: (1) The standard 

error or the average individual income can be used to determine an 
appropriate sample size; (2) The standard error or the proportion for 
individual income levels can be used for this purpose. As was mentioned 
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earlier, since so inauy of the measures of interest are either formally 
or informally related to the income level r ‘ !:he various Indians, we 
r ;ccmmend chat either of the standard errors involved with income will 
provide an effective strategy for the selection of a specific sample size. 
The resulting sample size can then be used for a given reservation 
with the assumption that it will provide satisfactory statistical results 
for all of the measures of interest on the questionnaire. An alternative 
to this procedure would be to use the same approach for several impor- 
tant estimates listed in the questionnaire and choose a sample large 
enough that each of these characteristics is estimated with sufficient 
precision. The first approach, which we en ploy in this study, assumes 
that by controlling the standard error of the income measure, a reliable 
control of the total sampling error is achieved. Because of its effec- 
tiveness, we consider only the use of the standard error of the mean 
individual income estimate for the determination of an appropriate 
sample size in this study. 

Two major decisions must be made in ‘>rdsr to determine an 
appropriate sample size: (1) First, we must determine the amount of 

error we can tolerate for the income estim-..,<i of interest; (2) Second, 
we must determine the confidence we must have that the estimate will 
result within the acceptable error limits. The first factor defines the 
desired precision of the estimate and the second designates the reliability 
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of a^.i-ieving the precision or lective . We can summarize our dual 
concern for the precision-r-:. _,.ability combination by looking at the 
question of, "What is the p : -ability that the mean individual Indian 
income will be accurate within a given range. " This analysis can be 
expressed as follows: 

A =: kv 



Whe re : 

A = Precision required for the income estimate. 

k = The number of standard deviations expressing the 

confidence the sampler must have in the estimator. 

The number 2 often is used to approximate a 95 
percent confidence interval. Other possibilities are 
shown in Appendix D, With k = 2, we know that 95 

percent of the time our estimate will be within the 
acceptable range, A. 

V = The standard error defined as the population standard 
deviation (amount of variance) divided by the square 
root of the sample size, n. 

As was mentioned above, we consider only the use of the standard 
error of the individual income average for sample size selection purposes. 
Essentially, the same analysis is valid for the use of the standard error 
of proportions. A preliminary study should be performed to obtain a 
reliable estimate of the population standard deviation of individual 
Indian income. A sample of size 30 would normally be appropriate for 
this objective. This estimate can then be inserted into the following 
formula to determine the appropriate sample size which will allow an 
acceptable error and a satisfactory confidence level: 
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n 



^ Jc^NS^ 

+ k2s2 



Whe re : 

^ ~ The number of standard deviations necessary to achieve 

a desired confidence interval for reliability. 

N = Population size. 

A = Range of error allowed or precision (for example a range 
of $Z00) for the average income estimate. 

S - The population standard deviation which is normally 

estimated by the sample variance, s (See Appendix B). 



Once the appropriate sample size is determined, the sampler can 
draw the sample, estimate the measures of interest, and be assured that 
the error for the mean estimate of individual Indian income will be con- 
strained within acceptable limits. 

Sample program - The following steps should be taken to obtain 
a reliable simple random sample: 

1. Identifv the population elements consisting of all Indians who 
are sixteen years or older from the reservation rolls. 

2. Assign sequential numbers to the population elements. 

3. Obtain a table of random digits from a statistics textbook 
or the reference listed below. Dete rmine the number of 
digits required to represent the population element numbers 
and their range. For example, in the simulated income 
information in Appendix C, the numbers 001 to 000 would 
be used. 

4. Decide on a route to use in the table and select a starting 
point randomly. Select 30 random digits from the table 
and match them with the Indian population numbers. The 

30 individual Indian incomes represent a preliminary sample 



Acheson J. Duncan, Quality Control and Industrial Statistics. 
(3rd Edition, Homewood, Illincis, Richard D. IrwinV 1965), p. 938 . 
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which will provide an estimate of the population mean and 
the population standard deviation which, in turn, will be 
used to determine an appropriate sample size from the 
standard error with the formula above. 

5. Travel to and interview the 30 Indians selected for the pre- 
liminary sample. Uae the questionnaire shown in Appendix 
A to obtain information concerrdng each of the pertinent 
que stions, including the individual income levels. 

6. List the 30 sample incomes. Compute the sample mean 
and standard deviation for the individual incomes. Use 
the latter as an estimate for the population standard 
deviation. 

7. Choose an acceptable range of error for thi mean, individual 
income estimate. For example, an error of + $50, + $100, 

+ $200, + $300, etc. might be selected. Also, an acceptable 
percentage of the time this error will occur must be chosen. 
Normally, a confidence interval of 95 percent is used for 
this objective. As the precision is increased by narrowing 
the acceptable error, the sample size must be increased. 

8. Use the formula listed earlier to determine an appropriate 
sample size of n elements. Extend the preliminary sample 
of 30 to a sufficiently large number to achieve the required 
sample size of n elements through the use of a table of 
random digits. 

9. Travel to and interview all Indians chosen for the sample to 
obtain the information listed on the questionnaire. 

10. Tabulate the results of the interviews to quantify the statistical 
measures of interest for the questionnaire. 

11. Compute the new sample mean individual income and the re- 
lated standard error to evaluate the relevancy of the actual 
precision and reliability limits incurred. 

Simulat ed Sample Re sults - In the preliminary stage, a sample 
of 30 individual incomes was selected from the hypothetical population. 
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These sample results are shown in Appendix E. The sample mean 
income was $1, 514, with a standard d(3viation of $2, 452. It is obvious 
that a significant amount of income variation and skewness exists in 
the simulated population- -characteristics which appear likely for actual 
Indian reservations. 

Sample sizes required to achieve specific precision objectives 
under varying values for the relevant factors are shown in Appendix F. 
The four factors which must be defined to determine an appropriate 
sample size are; (1) N, the population size; (2) A, the desired pre- 
cision; (3) k, the desired confidence level; and (4) v, the standard 
error. The analysis in Appendix F is based on a confidence level of 
. 95 and the standard deviation estimated from the preliminary sample. 
Different sample sizes result from varying assumptions concerning N 
and A. It is obvious from the results of the simulated sampling per- 
formance that simple random sampling will be a costly method because 
of the significant dispersion in the Indian income population. If this 
situation exists in a real-life case, the costs of simple random sampling 
required to achieve a satisfactory precision level may be prohibitive. 

For example, for a reservation of 1, 000 eligible Indians, a sample 
size of 706 elements would be required to insure a sample individual 
income average within $100 of the true value at the . 95 confidence 
level with the standard deviation c ;-his case. If we want to be within 
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$50 of the true value, we would have to practically enumerate the entire 
population, since a sample of 905 elements would be required. Even if 
we relax the precision requirement and accept the estimator within $200 
of the true value 95 percent of the time, a sample of 37 6 elements would 
be required. Other combinations are shown in Appendix F for illustrative 
purposes. While desired precision levels can be attained in this case 
with simple random sampling, the costs will be high. 

The costs shown in Table A are assumed as being realistic for 
determining the cost of simple random sampling with the hypothetical 
Indian income population. These costs represent the parameters of the 
cost functions defined earlier. In actual applications, the specific costs 
of performing the various functions in the sample survey should be 
evaluated carefully. It also should be noted that the cost rates involved 
may change with different sampling plans, as well as economic circum- 
stances. 



Table 6 

A ssumed Costs With Simple handom Sampling 

F = $100 

b = $.20 

c = $. 01 

d = $1.00 

e = $3. 00 

g = $.80 
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We assume that a precision level of the 3amp.^.: • ?ean individual 
income estimate within $200 of the time value is acceptable at the . 95 
confidence level in simple random sampling. Consequently, a sample 
size of 376 Indians would be required. Sfnee we have 30 in the prelim- 
inary sample, an additional 346 elements are required. The costs 
involved with the total survey using simple random sampling then will 
be approximated as follows: 

PC = $100 + . 01 (1000) 1 (30) + 3 (30) + , 20 (30) 

$236 

SC = 1 (346) + 3 (346) + . 30 (376) 

= $1,684.30 

TC = $1, 920. 80 

Thus, the total cost in this case of interviewing 376 Indians 
selected with simple random sampling at the desired precision level 
will be approximately $1, 920. 80. If a precision of $100 for the income 
estimate is required, the same cost function would produce approximate 
costs of $3, 504. 80. The economic feasibility of these cost levels must 
be evaluated and compared with that of other sampling plans. After the 
additional elements have been selected randomly to extend the prelimin- 
ary sample results to an acceptable sample size, the Indians involved 
would be interviewed to ask the questions listed on the questionnaire. 
These statistics then would be processed to describe Indian life based 
upon a sample which insures a satisfactory precision level concerning 
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their income performance. Using the final statistical results of Appendix 
E, we are assured that the true mean individual Indian income will be 
within the range $1, 229 to $1, 7 01 approximately 95 percent of the time. 
So, with simple random sampling, the potential sampling error is sig- 
nificant. 



STRATIFIED SIMPLE RANDOM SAMPLING 
When the Indian population has a great deal of variability between 
the income levels of the various Indians (this appears likely in most 
cases), a large sample is required to attain a tolerable precision level 
with simple random sampling and other methods. The previous discussion 
of simple random sampling illustrates this problem. By reducing the 
variance, the standard error decreases And higher efficiency (precision 
per cost unit) can be achieved. The effe of extreme values is offset 
to a large degree with such action. In order to do so, the objective is 
to stratify the total population into homogeneous groups. Since individual 
income is our major statistic of interest for survey design purposes, 
we would want to group the Indians in such a way that individuals with 
similar income levels are in the same strata. The total variability of 
income levels is reduced g.^eatly if this is achieved since the main 
variance comes from a between group's source because of the homo- 
geneity within the strata. Consequently, an effective precision-cost 
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trade-off can be achieved since, a smaller sample size is required to 
attain a particular precision objective than with simple random sampling. 
We still have a probabilistic sample as each element has an equal chance 
of being selected for the sample. 

In order to achieve a significant amount of income homogeneity 
within the strata, we need to have some reasonable basis with which to 
classify the population. However, we do not have information about 
income until the sampling is accomplished and even then we do not 
know the income levels of all Indians in the population.. Thus, we need 
to obtain information concerning a known measure which is correlated 
highly with the income performance. This measure can then be used 
to stratify the population in an appropriate manner. One suggestion for 
accomplishing this objective is to use vehicle registration statistics. 

By assuming a high correlation between the age of the vehicle owned 
by a given Indian and his income level, the registration statistics can be 
used to classify the population into meaningful strata. If no vehicle is 
owned, a zero or low income level may be assumed. 

In an illustration of sti ified simple random sampling with the 
hypothetical Indian population of Appendix C, we know the various income 
levels so we can stratify based on our analysis of the income values. 

In a real-life ca.se, however, we would have to make certain assumptions 
about the relationships betw^een a correlated variable, such as vehicle 
registration and income levels. An example of this approach is listed 
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in Table B. Since perfect correlation is not required to recei /e signi- 
ficant benefits from stratified sampling, the procedure offers potential 
advantage s . 



Table 7 



Assumed Correlation Beb^' en 
Vehicle Registration and Indian Income 



Vehicle Age (Years) 



Income (Dollars) 



None owned or greater than 10 
6-10 
4-5 
2-3 

Less than 2 



$0-500 
501-100C 
1001-3000 
3001 -6000 
Greater than 6000 



Sample Size Selection - The selection of a sample size that 
will provide the desired precision results with stratified simple random 
sampling is based upon the same approach as that illustrated earlier 
for simple randor'i sampling. Once the various strata are identified, 
the number of Indians that must be selected to achieve the 'desired 
interview results is based on the same basic formula: 

A = kv 



Where: 

A = The desired precision of the estimate in terms of the 
deviation allowed. 

k = The number of standard deviations expre sing the 
confidence the sampler must have in the estimate. 

V = The standard error which is defined in Appendix B for 
stratified simple random sampling. 



Once again, in order to be able to select an appropriate sample 



size, \v'G need to have a reliable estimate of standard deviation. But 
since the population is stratified, we need an estimate of the standard 
deviation from each of the strata. A preliminary sample of 30 Indian? 
from each of the strata. A preliminary sample of 30 Indians from each 
stratum should be obtained for this purpose. Then, the following 
formula can be used to select a sample size which offers an acceptable 
precision level: 

L 

(Nj^Sh) 

n^ = 

+ (NhSh 

k 

Where: 

^h Appropriate sample size for the hth stratum. 

= Population size for the hth stratum, 

Sh = Standard deviation for the hth stratum. 

N = Total population size. 

k = Number of standard deviations required for an 
acceptable confidence level. 

A = Des ired precision describing the range within which 
the estimate must fall. 

A = Number of strata. 

Once an appropriate sample size is determined for each stratum, 
the samolsr w:.ll draw the sample, estimate all measures of interest 
from the questionnaire, and be insured that the error for the estimate 
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of the mean individual Indian income will be within acceptable limits. 

If the stratification is performed effectively, a smaller sample will be 
required to achieve comparable precision tha.n that obtained with simple 
randoim sampling. 



Sample Program 

The following steps should be taken to obtain a reliable stratified 
simple random sample : 

1. Identify the population elements consisting of all Indians 
who are sixteen years or older from the reservation roll. 

2. Assign sequential numbers to the population elements. 

3. Stratify the population into several classes with the objective 
of grouping simila.r income levels together. Use some 
measure, such as vehicle registration for stratification 
purposes. Four or five strata normally will be sufficient 

to achieve the desired results in this type of study. The 
population element should be matched with vehicle regis- 
tration statistics according to decision rules, such as 
those shown in Table B. 

4. Randomly select 30 elements from each stratum by using 
the same random digit procedure discussed earlier for 
simple random sampling. If less than 30 elements are 
included in a stratum, use all of them. 

5. Travel to and interview each of the Indians selected for the 
stratified sample. Obtain information related to each 
question listed on the questionnaire, including the measure 
of his individual income. 

6. Compute the total sample mean individual income and the 
standard deviation for each stratum. Use the sample 
standard deviation as an estimate of the population standard 
deviation for each stratum. 
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7. Choose an acceptable range of error for the mean inJividual 
mcome estimate (precision). For example, as beff)re an 
error of $50, +^$100, i- $200, etc. might be selected. 

Also, an acceptable percentc^ge of the time this error will 
occur must be chosen (reliability). 

8. Use . e formula given earlier to determine the appropriate 
sample size for each stratum. Extend the preliminary 
sample of 30 per stratum, if necessary, to achieve the 
required sample of n items where n is equal to the sum of 
the elements drawn from the strata. 

9. Interview all Indians chosen for the sample to obtain the 
inforimition listed on the questionnaire. 

10. Tabulate me resnJxs of the interviews to quantify the 
statistical measures of interest from the questionnaire. 

11. Compute the new sample mean individual income and re- 
lated standard error to determine the relevancy of the 
actual precision and reliability incurred. 

Simulated Sample Results - The hypothetical Indian population 
was strat:u;ed as shown in Table C to reduce the effect of the skewed 
income distribution and related variance concerning the sampling process. 
A preliminary sample of 30 elements were selected randomly from each 
of the four strata. The statistical results of the stratified sample 
process are summarized in Appendix O, Note that the standard error 
of the average income estimate has been reduced significantly, thus 
suggesting that a much smaller sample size will b^- required than with 
simple random sampling to achieve comparable results. Usii g the 
formula for sample size selecC^^n with stratified simple random sampling, 
we confirm this preniise. In this case, the preliminary sample itself 



will produce a sufficient number of elements for a precision require- 
ment of $100 or more. If we want to narrow the tolerable range to 
$50, we need only extend the preliminary sample to a total of 191 
elements distributed as shown in Table D. 

We should keep in mind that even though a sampling error of 
$50 on the average seems sir.'all, this produces a potential error of 
$50, 000 for the total income of a reservation as small as the one with 
which we are dealing--given the assumed income data. Or, looking at 
another way, with a :ean individual income of about $1, 500, $50 

represents approximately 3. 3 percent of the average income. 



Stratum 

1 

2 

3 

4 



Table 8 

Strata Established 
for 1000 Simulated Individual 
Indian Incomes 



Income Range 


# of Elements 




$0-999 


606 


0001 -0606 


1000-2999 


235 


0607-0842 


3000-6999 


120 


0842-0962 


7000 and above 


39 






1000 






50 
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Total 



Table 9 



Stratum 


Sample Size Per Stratum 
Sample Size 


Sample Size 




D = 50 


D = 100 or more 


1 


48 


30 


2 


63 


30 


3 


56 


30 


4 


24 


30 



Total 191 120 



A precision level of $100 would be $100, 000 for the income total 3 r 6. 6 
percent of the average income. we use the precision limits of $100 
as a comparative base, we see that the same results can be achieved 
with a sample of IZO elements, using stratified simple random sampling 
that would require 706 units with simple random sampling. Thus, as 
long as the costs of preparing the population for stratified sampling are 
not exceedingly high, the stratified method offers substantial cost 
benefits with acceptable precision limits. We use the assumed costs 
shown in Table E for an analysis of the costs incurred to obtain a 
stratified random sample in this case. Once again, since we are deal- 
ing presently with a hypothetical case, these costs are based upon 
expected conditions which should generally prevail. In a specific real- 
life case, the costs used for analysis purposes should be based on the 
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prevailing conditions. 



Table 10 



Assumed Costs With Stratified Simple Random Sampling 

F = $125 

b = $. 30 

c = $, 50 

d = $1.00 

e = $3. 00 

g = $.80 



Some of the individi’.al costs items have chajiged from those used for 
simple random sampling !. ecause of the basic nature of stratified simple 
random sampling. We would expect the fixed costs of the preliminary- 
stage to increase since the decision makers -will have to spend more 
time planning an appropriate stratification technique. Also, more 
processing time vAll be required for computing the mean and standard 
deviation, since the stratified sampling formulas are more complicated. 
Since additional information -will be required to achieve the stratification 
objective (such as vehicle registration statistics), the variable cost of 
dealing with the population in the preliminary stage will increase suo- 
stantially. In order to achieve a precision level of $100 for the mean 
individual Indian income, the following approximate costs would be 
incurred in thi case. 



PC = $125 + .50(1000) + 1(120) + 3 (120) + .3(120) 

= $1141.00 

SC = $.80 (120) 

96 . 00 

TC = $1237. Q O 
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Thus, in a situation like this, where a large income variance 



exists on a given reservation, substantial savings are offered through 
the use of stratified simple random sampling. Since extreme differences 
between income levels on a reservation are likely based on previous 
studies, we strongly recommend stratified simple random sampling as 
a reliable way to overcome the problem. By incurring a higher cost 
to prepare the population for sampling through effective stratification, 
a much smaller sample size will be necessary in cases such as the one 
illustrated and the precision-cost trade off is much more favorable than 
that of many other methods. Even in cases where the preliminary sample 
has to be extended to achieve the desired precision, substantial cost 
savings should be possible when the wide \-ariation of income is a problem. 
We can evaluate the validity of the final estimate for the mean individual 
Indian income as follows: The true value will be within the range of 

about $1, 343 to $1, 487 at the . 95 confidence level (or saying the same 
thing, 95 percent of the time). Thus, the precision involved is quite 
high with stratified simple random sampling. 

SINGLE STAGE CLUSTER SAMPLING 

Cluster sampling is involved v^hen the Indian reservation is 
divided into areal units --each of which consist of several Indians 
(actually, Indian domiciles may be used). In certain cases, single 
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stage cluster sampling may provide an effective method for applying 
the Indian questionnaire. ThB major benefit of cluster sampling is that 
it is useful where a wide geographical area is involved, since the cost 
of traveling between domiciles to obtain the sample size required for 
one of the other methods described in this section may be prohibitive. 

A secondary benefit of cluster sampling applies to a situation where 
the population of a specific reservation is not well defined on the popu- 
lation rolls and the interviewer is forced to identify a representative 
setment of the population so the survey can be taken. By restrmting 
his analysis to localized geographical areas (clusters)., the interviewer 
may be able to avoid attempting to cover an entire population. 

Cluster sampling must be used with caution, however, to insure 
that a satisfactory precision level is attained. This problem can be 
'.readily seen from an analysis of the basic characteristics of cluster 
sampling. Instead of selecting the sample from all segments of the 
population, the final sample consideration in cluster sampling is directed 
toward a limited part of the population. We still are dealing with a 
probability sample, since all elements have an equal chance of being 
chosen, but the final selection phase is concerned with a limited 
coverage. In the preliminary stage, the population is divided into 
groups (clusters) of Indians based upon the geographic conditions 
involved. These groups (clusters) of Indians located close together 
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serve as primary sampling units, Recall that in stratified sampling, 
the objective was to group the Indians in a way that would achieve a great 
deal of homogeneity within the strata, thus reducing the variability of 
income levels and necessary sample sizes. In cluster sampling, we 
usually do not expect similar precision savings, since the geographical 
conditions primarily dictate the group structures. Within the geograph- 
ical constraints, the idea is to attempt to attain as much heterogeneity 
as possible within each of the clusters so they serve as miniature popu- 
lations. If an extremely large amount of heterogeneity is achieved, it 
is possible that nominal precision gains compared with other methods 
may result. But, even though precision gains norma.lly are not achieved, 
reduced travel costs may offer an attractive precision per cost unit 
opportunity; since the sample elements are so close together, the sampler 
cam afford to select a larger sample than other methods would allow. 

Once the clusters have been identified, the sampling process 
begins by randomly selecting a sufficient number of clusters that will 
insure that the desired precision standards are met. Then, each 
Indian located in the selected clusters is interviewed so the; sampler 
actually enumerates all elements of the chosen clusters. All clusters, 
therefore, are not chosen for the sample and all of the sampling error 
is related to this dimension. The cluster method also can be extended 
to a two stage approach^ whe re we not only sample to choose the clusters, 
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selected to determine the final 



but also sample within rhe clusters 
group of Indians to be interviewed. We do not cover this case in this 
study. If each cluster contains Indians with significant differences in 
their individual income levels, it will serve as a miniature population 
and the method may produce satisfactory precision results. vSince the 
travel cost of drawing the sample with this method is not related to 
sample size to the degree other methods are, the precision per unit 
of cost may be attractive. The two main factors to consider then in 
evaluating the feasibility of cluster sampling for a given Indian reserva- 
tion are: (1) the significance of the travel costs; (2) the degree of 

heterogeneity which can be achieved with the various clusters so the 
sample size does not have to be extremely large. 



Sample Size Selection 

The use of the cluster sampling standard error for the determin- 
ation of a satisfactory sample size is much more complicated and less 
effective than the processes described for the first two methods. However, 
since our objective with single stage cluster sampling normally is not 
to achieve precision gains, a useful approach for determining the selec- 
tion of an appropriate sample size is to use the standard error for simple 
random sampling rather than that of cluster sampling. The standard 
error for simple random rampling is less complicated to work with 
and provides an effective way to determine a satisfactory sample size 
with certain limiting assumptions. 
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Basically, we assume that approximately the same degree of 



precision incurred with simple random sampling will prevail for cluster 
sampling. As long as a reasonable amount of heterogeneity is achieved 
v/ithin the clusters, this assumption is realistic. Normally, the sampling 
error with cluster sampling will be somewhat higher when compared with 
that of simple random sa.mpling for a given sample size, but if adequate 
heterogeneity is attained, the difference will not be significant. As was 
mentioned earlier, modest precision gains may even result if the heter- 
ogeneity is sufficiently high. We can use the same approach as that 
described earlier to determine an appropriate sample size with simple 
random sampling. A preliminary sample is taken to estimate the popu- 
lation standard deviation and this statistic is combined with the desired 
precision limit and reliability to calc late an appropriate sample size. 

The precision results can be evaluated to see if additior sample infor- 
mation should be obtained to further reduce the standar ctot. Once 

the total elements have been s(dected, this number mu; be translated 
into clusters, since the latter constitute the sample. ssuming that 
the same number of elements have been assigned to each cluster, we 
need only divide the total number of elements by the cluster size to 
determine the number of clusters which must be selected. For example, 
if the sample size formula determines that 300 elements must be drawn 
and each cluster contains 15 elements, 20 clusters would be enumerated 



for the sample results. 



Sample Program 



The following steps should be taken to obtain a 



reliable single stage cluster sample: 

1. Identify the population elements consisting of all Indians 
who are sixteen years or older from the population roll. 

2. Match the location of the Indians in No. 1 with a map of 
the reservation, 

3. Based upon an analysis of the geographical conditions 

shown on the map (for example, villages in existence, 
spread between domiciles, accessibility, etc. ) and an 
approximation of the income heterogeneity involved, 
divide the total reservation into several of 

Indian domiciles. An equal number (or at least approx- 
imately so) of eligible Indians should be included in each 
cluster. 

4. Assign sequential numbers to the population elements and 
separa^te numbers to the clusters, 

5. Obtain a table of random digits and select a simple randoin 
sample of 30 Indians, based upon the same procedures 
discussed in an earlier section, 

6. Travel to the location of the Indians selected and interview 
them with the questionnaire. 

7. From the preliminary sample determined in No, 5, calcu- 
late the sample mean individual income and related standard 
deviation. Use the sample standard deviation tc deterrnme 
an appropriate sample size with simple random sampling. 

8. Choose an acceptable precision range and reliability iirnif. 
just as was done with simple random samM-Ungc Use the 
formula A =; kv to calculate an a.ppropriata sample size 
of n elements. 

9. Divide the sample size in No. 8 by the raLvn.' of elements 
included in each cluster. The result is f‘:e \ umber of 
clusters which must be selected. 



10. Use the table of randoTn digits to select the number of 
clusters required for the sample information. 

11. Interview all Indians located in the clusters chosen. 

12: TabuLate the results of the interviews* to quantify the 

statistical measures of interest fiom the questionnaire. 

13. Compute the new sample mean individual incon. . and the 

related sample standard error to determine the relevancy 
of the actual precision ?md reliability incurred. 

Simulated Sample Resul ts 

In order to illusti-ate the application of cluster sampling, we 
assume thax the hypothetical Indian population of 1, 000 elements can be 
divided into 50 clusters of 20 Indians, based upon an assessment of the 
prc v ailing geographical conditions and objective of heterogeneity within 
the clusters. The specific characteristics of a given reservation must 
be considered carefully in a real-life case to achieve as much heter- 
ogeneity of individual Indian income within the clusters as the geograph- 
ical conditions will allow. But the primary objective is to minimize 
the travel costs required between the clusters so precision consider- 
ations are secondary to the cost factors. A map of the reservation 
should be used to effectively group the eligible Indians into realistic 
clusters. The reservation rolls should be matched with the map for 
this purpose. Local inquiry may be required to supplement, check, 
and update the information recorded on the map and population roll. 
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Using the rule suggested earlier for sample size, we find that 
19 c lusters must be chosen to receive the 376 elements recjuired for 3-n 
income estimate within $200 of the true value 95 percent of the time. 

The 19 clusters give us 380 sample elements to work with. The pre- 
j_fYiinary results would be obtained for the cluster sampling method 
with simple random sampling to identify the significance of the income 
variance as was discussed earlier. Then, the proper number of clusters 
would be selected to obtain the required sample size. The sample re- 
sults with cluster sampling are shown in Appendix H. The sample mean 
income was $1,464, with a standard error of $29-43. The approximate 
costs of obtaining the cluster sample information are based upon the 
data shown in Table 11. 

I Table 11 

Assumed Costs With Single Stage Cluster Sampling 

F = $130 

d = $. 20 

e =: $. 30 

d = $. 30 for SC; $1. 00 for PC 

e = $3. 00 

g = $. 80 

Once again, many of the individual cost items have changed 
■when comoared "with the first t^vo methods because of the unique char- 
acteristics of cluster sampling. The fixed costs will increase slightly 
from those of stratified sampling, since we must be concerned with 
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geographical factors, as well as a classification strategy which achieves 
some heterogeneity. The variable cost of processing the preliminary 
sample in order to determine an appropriate sample size will be equal 
to that incurred for simple random sampling. The travel cost of obtain- 
ing the regular sample will be lower than that of other methods, since 
we are dealing with clusters. An additional cost is expected with cluster 
sampling because of the fact that the preliminary sample cannot be 
extended to the regular sample, since the latter consists of clusters. 
Some of the preliminary sample units no doubt will be located in the 
clusters chosen, but we will not know how many there will be until 
the clusters are chosen. 



PC = 130 + . 30 (1,000) + 1 (30) + . 20 (30) 
- 556 

7^/ SC = .30(380) + 3(380) + .80(380) 

= 1,558 

7/ TC = $2, 114 



Therefore, the precision per unit of cost with cluster sampling 
would be measured by a small standard error of approximately $29. 43 
at a cost of about $2, 114. Consequently, we have a better precision 
per 'unit of cost here than we had with simple random sampling when the 
standard error was $118 and the cost was about $1, 921. So, for a 



_?/ It is likely that some of the preliminary sample elements 
also will be included in the regular sample, so the cost function may 
be overstated. 
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slightly higher cost, cluster sampling produces a lower standard error 
in this case. This outcome resulted from the high degree of hetero- 
geneity with the clusters selected for the sample and will not always 
occur, since in the typical case, we use cluster sampling for cost 
benefits but do not expect precision gains. We can analyze the actual 
precision that results by noting that we can expect the true value of the 
mean individual Indian income to be within the range of about $1, 404 to 
$1, 524, approximately 95 percent of the time. So- again we see that 
the error range is narrow in this case. 

SYSTEMATIC SAMPLING 

The final method discussed in this section for conducting the 
Indian interviews is systematic sampling. The chief advantage of this 
method is its simplicity to accomplish. A decision rule is built in tr 
inform the sampler as to the elements he must select rather than re- 
quiring him to depend on a table of random digits. Also, in certain 
conditions, the standard error with systematic sampling is less than 
that incurred with alternative methods --although there is no guarantee 
of this outcome. The major disadvantage of systematic sampling is 
the danger of periodicity, which means that elements with essentially 
the same characteristics are selected regularly so a bias is introduced. 
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This potential problem is obvious when we analyze the major character- 
i;^tics of systematic sampling. 

The basic idea is that every kth Indian located on the reservation 
roll is i: elected for the sample. Once the popula.tion of eligible Indians 
is identified, an appropriate sample size n is selected and k is deter- 
mined as — . A popular approach is to begin the sampling by randomly 
selecting an Indian in the population listed between 1 and k. This pro- 
vides a starting number and every kth Indian then is selected to con- 
stitute an effective sample, r'or example, in the hypothetical population 
of 1, 000 Indian incomes, we might desire a sample of ZOO, so k is 5. 

We would randomly select a number between 1 and 5 to begin the 
sampling. Assume we select a 3. The sample would include Indians 
listed in the 3rd, 8th, 13th, 18th, etc. positions. The basic problem 
relates to the fact that the data involved may be periodic in relation to 
the order of the listing. The selected elements may be alike and thus 
would not be representative of the total population. If essentially the 
same value is repeated over the sample selected, a significant standard 
error is introduced. An effective way to minimize the possibility of 
periodicity is to randomize the population roll used to draw the sample. 
The alphabetized list maintained on the reservation roll v/ould be 
changed to a randomized order through the use of a table of random 
digits. Then the systematic sample would be drav/n after k was selected. 



A counte r -balancing disadvantage of randomizing the population 
is that precision gains are incurred with systematic sampling when there 
is a high correlation between the income level of adjacent Indians located 
on the population list. The serial, correlations involved should decrease 
as the interval between the individual Indians increases. When the serial 
correlations of near-by Indians are large with decreasing correlations 
between the units farther apart, significant precision gains may result 
with systematic sampling. Therefore, the sampler can organize the 
population Indian income data in a way that achieves the serial correla- 
tion objective and, at the same time, avoids the threat of periodicity, 
the randomization procedure should not be used. However, it is 
doubtful that the information requ.ired for such an effective organization 
will be available, since generating it is the primary objective of the 
sample process. T of systematic sampling in a situation 

like an Indian study is uxiij-nely to produce consistent precision gains and 
its results will be similar to those of simple random sampling. Since 
precision gains probably car jt be achieved intentionally, the major 
advantage of the method is its simplicity and structured rule for the 
selection of the sample elements. 

Sample Size - The standard error with systematic sampling is not 
amenable for the selection of an appropriate sample size. However, 
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since our objective with this method in a case, such as an Indian study, 
normally is not to achieve precision gains anyway, the standard error 
with simple random sampling can be used in a preliminary study to 
determine a satisfactory sample size. The approach described earlier 
for simple random sampling can be used to define an effective sample 
size by taking a preliminary survey of 30 Indians randomly, determining 
the sample standard deviation, and using the formula defined earlier for 
sample size selection. Once n is established, k also is defined by 
dividing N by n (Francional results are rounded to the nearest v/hole 
number). A random start between 1 and k initiates the sampling process 
and every kth Indian is selected for the sample. Consequently, we will 
achieve approximately the same precision-cost results as we would have 
with simple random sampling, but we have a systematic strategy to 
fo How. 

S ample Progiu^.m - The following steps should be taken to obtain 
reliable sample information with systematic sampling: 

1. Identify the population elements consisting of all Indians 
who are sixteen years or older from the reservation roll, 

2. Randomize the sequential order of the list in No, 1 to 
minimize the chance of pe riodicity. 

Randomly select a preliminary sample of 30 Indians from 
the new list. 
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3. 



4. Ti-avel to ajid interview the 30 Indians selected for the pre- 
liminary sample. Use the questionnaire shown in Appendix 
A to obtain information conceiving each of the pertinent 
questions, including the individual incoine levels. 

5. List the 30 sample incomes. Compute the sample mean 
and s-andard deviation for the individual incomes. Use the 
latter as an estimate for the population standard deviation. 

6. Choose an acceptable range of error for the mean individual 
income estimate. Also, determine a satisfactory reliability 
level--the percentage of the time this erroi" must occur. 

7. Use the formula listed earlier (including the precision, 
reliability, and estimate of the population standard deviation) 
to determine an appropriate sample size of n elements. 

8. Determine k, the sampling interval, by dividing the sample 

size into the population (_). 

n 

9. Randomly select a starting point between 1 and k. Then 
choose every kth Indian to constitute a total sample of n 
items. (Round any uneven number for k to the nearest 
digit). 

10. Travel to and interview all Indians selected for the sample 
of size n. (Hopefully, some of those selected already will 
have been interviewed in the preliminary stage - -although 
there is no guarantee of this). Obtain the pertinent infor- 
mation listed on the questionnaire. 

11. Tabulate the results of the interviews to quantify 
tical measures of interest from the questionnaire. 

IZ. Compute the new sample mean individual income and related 
standard error to evaluate the relevancy of the actual pre- 
cision and reliability limits incurred. 

Simulated S amp le Results - The same preliminary sample results 

generated with simple random sampling earlier (Appendix E) were used 

to determine an appropriate systematic sample. Precision and reliability 
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limits of $200 and 95 percent, respectively, were assigned lor the 
systematic sampling project. A sample of 376 elements were required 
to achieve these objectives, so k was set equal to 3 to provide approx- 
imately the required number of elements. A random selection of the 
number 2 established the starting point and every third number in the 
randomized version of the hypothetical India.n population was selected 
(2nd, 5th, 8th, 11th, 14th, etc. ). The sample of 333 elements produced 
a mean individual Indian income of $1, 448 and a standard error of approx- 
imately $125, as shown in Appendix I. The major advantage of this 
method obviously is its simplicity, since an automatic decision rule is 
built in to determine the elemer ;s required ^or the sample. 

The cost estimates with systematic 3ampling are shown in Table 
12. These costs are very similar to those incurred with simple random 
sampling, except for the variable costs o .eparing for the preliminary 
sample. 



Table 12 



Assumed Costs With Systematic Sampling 

F = $100 

b = $.20 

c = $. 05 

d = $1. 00 

e e $3.00 

g = $.80 
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Based upon these costs, the approximate costs with systematic 



sampling would be as follows: 



PC 

SC 

8/ TC 



$100 + .05(1000) + 1(30) + 3(30) + .2(30) 

276 

1 (333) + 3 (333) + . 80 (333) 

1, 298. 40 
1, 588. 40 



The costs with systematic sampling and the hypothetical Indian 
population are lower than they are with simple random sampling, but the 
standard error is aigher. However, a smaller sample has caused this 
lower cost because of the rounding process to compute k. Since the 
sample is smaller, we have not achieved the precision limits originally 
planned, but the effect of the difference will be small. We can rely on 
the fact that the true value for the mean individual income will be in the 
range cf $1, 178 to $1, 718 at the . 95 confidence level. Thus, we have 
controlled the standard error within reasonable limits, although they 
are wider than .with stratified and cluster sampling. 



SUMMARY 



Four basic sampling methods which will provide objective and 
reliable ways to select an adequate number of Indians from the popula- 
tion have been discussed in this section. The use of one of these sample 
designs eliminates the need to deal with the entire Indian population 
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involved but does so with known precision and reliability limits. A 
hypothetical Indian population was employed to illustrate the use and 
hypothetical cost o£ the various plans. 

Sampling with, replacement is assumed throughout the analysis 
because o£ the nature and objectives of the Indian interviews. The focus 
CO control the precision and reliability of the sample results is on the 
individual Indian incomes, since so many of the statistical measures of 
interest on the questionnaire are highly correlated with the individual 
income levels. By controlling the sampling error for the income estimate, 
the authors suggest the total errors of S3.mpling can be regulated effectively. 

Each of the sampling method discussed have certain advantages and 
disadvantages --primarily from a cost and precision perspective. As was 
mentioned earlier, we suggest strongly that stratified simple random 
sampling will offer significant gains for most Indian studies. The diverse 
income levels of a reservation are likely to create a significant amount 
of variation and skewness, which will demand a large sarr ’’e 
simple random sampling, cluster samplings and systematic sampling. 
Precision gains will be achieved if an effective system is designed to 
structure the total population into strata. A known statistic which is 
highly correlated with the individual Indian income levels can be used 
to achieve an effective stratification strategy. Vehicle registration 
statistics, while they may not be correlated perfectly with incomes. 

Once again, it is likely that some of the preliminary sample 
elements also will be included in the regular sample so the cost function 
may be overstated. 



should provide a sound basis for the stratification process. Since the 
total variance is reduced primarily with stratification to a between 
group's variance because of the homogeneity v/ithin each stratum, a 
smaller sample is necessary to achieve given precision objectives than 
that required with other methods. Consequently, the costs are reduced 
substantially. Once a reasonable sample size is determined, the approp- 
riate number of Indians are interviewed through the use of the questionnaire 
shown in Appendix A to investigate the relevant facts concerning Indian 
life. As long as one of the four methods discussed in this section is 
sele c ted and employed, we are assured that the statistical results will 
be based upon an objective foundation, subject to a controlled sampling 
error. 
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APPENDIX A 



Sample Questionnaire for Indian Demographic Studies 



Name 
Addre ss 



Sex 

Year of birth 

Code 

Date Interviewed 
Interviewer 
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ASK ALL RESPONDENTS 



1. Did you work last week? Yes_ No 

2. Did you work during the past i-nonth ? Yes No _ 

3. How many hours a- week do you usually work at your job? 



1. 


1-14 


2. 


15-29 


3. 


■30-34 


4. 


35-39 


5. 


40 


6. 


41 -48 


7. 


49-59 


8. 


60 or more 



4. If you usually work less than 35 hours per week, what is the reason 

1. Slack work 

2. Material shortage 

3. Plant or machine repair 

4. Could only find part time work 

5. Labor dispute 

6. Bad weather 

7. Illness 

8. Too busy with housework, school, business, personal 

9. Did not want full time work 

10. Full time work week under 35 hours 

1 1 . Other reasons 



5a. Do you usually work 



1. Year round 

2. Seasonal 

3. Irregular 

5b, If you work seasonal, do you usually v/ork in 

1. Spring 

2. Winter 

3. Fall 

4. Summer 
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6. What were you doing most of last month? 



1. Working 

2. At job but not working 

3. Looking for work 

4. Keeping house 

5. Going to school 

6. Not able to work 

7. Retired 

8. Other 

7. Is your present job located on the reservation? Yes No 

8. Where did you learn your present job? 

1. Taught by employer 

2. Government training program 

3. Armed Se rvice s 

4. Formal schooling 

5. Other 



Questions for Tuose Not Usually Working 



9. Have you looked for a job during the past month? No Yes 

If yes, full, part time, or both? 

10. If looking for work, what have you been doing in order to find work 

1 . BIA 

2. State Employment Service 

3. Private employment service 

4. Checked with employer 

5. Friends or relatives 

6. Placed or answered ads 

7 . Othe r 
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5. Health problems 

6. Personal problems 

7. Lack of transportation 

8. Do not want to leave reservation ____ 

9. Other 

12. If you are not looking for work, why? 

1. Believes no work is available 

2. Could not find work 

3. Lack of schooling, training, or experience 

4 Employer thinks too young or too old 

5. Cannot arrange for child care 

6. Do not want to leave reservation 

7. Family responsibilities 

8. In school 

9. Ill health 

10. Other 

13. When did you last work at a regular full or part time job? 

1. Within past 12 months 

2. Within last 2 years 

3. Betvveen 2 and 3 years 

4. 3-4 years 

5. 4-5 years 

6. 5 or more years 

7. Never v/orked 

14. Why did you leave your last job? 

1. Person^?.! reasons 

2. 111. health ~ 

3. Retirement 

4. Seasonal job completed 

5. Laid off 

6. Unsatisfactory work conditions 

7. Other 
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11. Why do you think you have had difficulty getting a job? 



1. No jobs available in line your work 

2. Age - too young or too old 

3. Lack of necessary skills or experience 

4. Lack of necessary education or training 

5. Health problems 

6. Personal problems 

7. Lack of transportation 

8. Do not want to leave reservation 

9. Other 



12. If you are not looking for work, why? 



1. Believes no work is available 

2. Could not find work 

3. Lack of schooling, training, or experience 

4. ii^mployer thinks too young or too old 

5. Cannot arrange for child care 

6. Do not want to leave reservation 

7. Family responsibilities 

8. In school 

9. Ill health ~ 

10. Other 



13. When did you last work at a regular full or part time job 

1. Within past 12 months 

2. Within last 2 years 

3. Between 2 and 3 years 

4. 3~4 years 

5. 4-5 years 

6. 5 or more years 

7. Never worked 



14. Why did you leave your last job? 



1. Personal reasons 

2. IP health 

3. Retirement 

4. Seasonal job completed 

Laid off 

6, Unsatisfactory work conditions 

7. Other 
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15. 



Describe your present job or your last job (within 5 years) 



Has not worked in 5 years (skip rest of section) 



16. What is the name of your present or last employer? 

17. What kind of business or industry is this? 

17a. What kind of work were you doing? 



18. Does this job involve working 



(che 


ck one or more) 


1. 


In an office 


2. 


Inside, but not in an office 


3. 


Outside 


4. 


Machinery (specify) 


5. 


As a helper 



SIC code 51-53 
DOT code 54-59 
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19- What other types of employment have you had in the past five years? 
Employer Occupation Code Where Learned^:- 



60-63 

64-67 

68-71 



‘■"'Place of Learning 



1 . 

2 . 

3. 

4. 

5. 



Employe r 

Government training program 
Armed Services 
Formal education 
Othe r 



20. Have you been trained for any type of work not listed above? 
1 no 0 yes (specify type of training and approximate 

date) 
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21. Do you hold a union card? 
1 ye s 0 no 
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O 

ERIC 



22. Where does your family usually purchase groceries? 74 

1 On the reservation 3 Half and half 

2 Off the reservation 4 Don't know 

22a. How does your family usually pay for groceries at the time 75 

of purchase; 



1 Cash 2 Credit 3 Don't know 



23. Where does your family usually purchase automobile repairs? 76 



1 On the reservation 

2 Off the reservation 

3 Half and half 



4 Don't know 

5 None 



7 7 




Zone 77-78 
Subs ample 79~ 



80 

8 ' 



Card 



23a. 



How does your family usually pay for automobile repairs? 



1 Cash 3 Other credit 

2 Oil company credit card 4 Don’t know 

24. Where does your family usually purchase clothing: 

1 On the reservation 3 Half and half 

2 Off the reservation 4 Don't know 

24a. How does your family usually pay for clothing? 

1 Cash 2 Credit " Don’t know 

25. Are you now enrolled in a sc ol r training piogram: 

1 yes specify 

0 no 

26. What is the highest grade of school you have completed? 



01 


04 


07 


10 


13 


16 


02 


05 


08 


11 


14 


17 or more 


03 


06 


09 


12 


15 





27. Do you have a high school diploma or a GED 

1 yes 0 no 

28. Do you have any 

College degrees 1 yes (specify) 

0 no 



Technical or vocational school certificates 



29. 



What language is 



1 

0 



spoken most frequently in 
2 English 



yes (specify) 
no 

your home ? 
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10 



11 



2-13 



14 



15 



16 

17 



1 An Indian language 



3 Othe r 



30. 



Do you speak English? 



18 



1 yes 0 no 

31. Do you read English? 19 

1 ye s 0 no 

32. What is your marital status? 20 

1 Married 2 Divorced 5 K — , m ma: . -ed 

3 Widowed 4 Separated 

33. How many children have you? 21 

1 One 4 F our 7 Seven 

2 Two 5 Five 8 More than 7 

3 Three 6 Six 9 None 

34. How old are your children? 

22 

35. What were the sources of income received by you in the last 

twelve months ? 



1 ye s 0 no 
1 ye s 0 no 
1 yes 0 no 

1 yes 0 no 

1 ye s 0 no 



1 

1 



yes 0 
ye s 0 



no 



Gifts from children or relatives or churches . 23 



F rom sale of handle rafts , 24 

Self-employed income (business, farm trade, 
or professional enterprise) individual or 

as a partner 25 

Earnings from the ownership of a farm or 
ranch, a craft shop or retail store, a 



fishing, tourist, or guide -service business.. 26 
Earnings from a trade (carpenter, well- 
driller, paying contractor, auto mechanic, 
barber , bookeeper, ranch manager, dres s - 

maker, nurse, etc. ) 27 

Pension (s) (specify) . ... 28 

Assistance payments from Bureau of Indian 
Affairs 29 
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no 



1 


ye s 


0 


no 


Assistance payments from other public 












or private sources 




1 


ye s 


0 


no 


Interest or dividends on personal loans 












and investments 




1 


ye s 


0 


no 


Money receipts from tribal or individual 












allottee sources (timber sales, leases, 












permits, royalties, annuity payments. 












dividend payments) 


? 


1 


ye s 


0 


no 


Judgment or settlement funds 




1 


ye s 


0 


no 


Sale of property 


. . . 


1 


ye s 


0 


no 


Veteran's payments . 


... 3 


1 


ye s 


0 


no 


Social Security benefits 


... 3 


1 


ye s 


0 


no 


Unemployment insurance 


... 37 


1 


ye s 


0 


no 


Other (specify) 


38 


1 




n 






... 39 


yes 


u 


no 






What was 


your 


total money income in 1967? 


40 


1 


$ 




0 


5 $ 2, 000 - 2, 999 




2 


$ 




1 


499 6 $ 3, 000 - 4, 999 




3 


$ 


5 


00 - 


999 7 $ 5, 000 - 9, 999 




4 


$ : 


1, 000 - 


1, 999 8 $10, 000 and over 





37. Did you receive any non-money income such as 



38. 



1 yes 0 no 



1 yes 0 no 
1 yes 0 no 

1 yes 0 no 

What is the monetary equivalent of your additional income ? ... 45 

1 $ 0 

2 $ 1 - 499 

3 $ 500 - 999 

4 $1, 000 or over 



Homegrown and consumed agricultural 

products 

Homemade clothing • • 42_ 

Goods exchange for other goods 43_ 

Othe r 44_ 
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39 . 



What were the sources of income received by your family in the 
last twelve months; ~~ 



1 yes 0 no 
1 yes 0 no 
1 yes 0 no 



1 ye s 0 no 
1 yes 0 no 



Gifts from children or relatives or churches 
From sale of handicrafts 

Self-employed income (business, farm trade, 
or professional enterprise) individual or 
a-s a partner 

warnings from the ownership of a farm or 
ranch, a craft shop or retail store, a 
fishing, tourist, or guide - se rvice business 
Earnings from a trade (carpenter, well- 
driller, paying contractor, auto mechanic, 
barber, ranch manager, dressmaker, 
nurse, etc. ) 



46 

47~ 



48 



49 



50 



40. 



1 


ye s 


0 


no 


Pension( s) ( specify) 


51 


1 


ye s 


0 


no 


Assistance payments from Bureau of 
Indian Affairs 


52 


1 


ye s 


0 


no 


Assistance payments from other public or 
private sources 


53 


1 


ye s 


0 


no 


Interest or dividends on personal loans 
and investments 


54 


1 


ye s 


0 


no 


Money receipts from tribal or individual 
allottee sources (timber sales, leases, 
permits, royalties, annuity payments, 
dividend payments) 


55 


1 


ye s 


0 


no 


Judgment of settlement funds 


56 


1 


ye s 


0 


no 


Sale of property 


57 


1 


ye s 


0 


no 


Veteran*s payments 


58 


1 


ye s 


0 


no 


Social Security benefits 


59 


1 


ye s 


0 


no 


Unemployment insurance 


60 


1 


ye s 


0 


no 


Other (specify) 


61 


1 


ye s 


0 


no 


None 


62 


Wlxat was 


the 


total monetary and non-monetary income of 


63 



ERIC 



1 

2 

3 

4 



$ 

$ 



0 


- 499 


5 


$ 3, 


000 


- 4,999 


500 


- 999 


6 


$ 5, 


000 


- 9,999 


000 


- 1, 999 


7 


$10, 


000 


and over 


000 


- 2, 999 








Sex IM OF 



81 

85 V 



Year of birth 
Zone 

Sub sample 
Card 
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Health Planning 



1. How di. you describe your general state of health? 

e xc e lie nt 

good_ 

fair 

poor 

2. Do you have a disability which prevents you from working? 

ye s 

3. Have you ever visited a doctor? 

ye s ^o 

4. Have you ever used the Indian Health Service Clinic? 

yes 

5. Do you prefer to use? 

Indian Health Service 

Off reservation private facilities^ 

6. If you visit the Indian Health Service, how often during a year? 

1 3 5 7 9 

246 8 ^ 1 0 

7,. If female and have had children, were they born 

in a hospital on the reservation 

clinic facility on the reservation 

at home 

8. Were you attended? 

by a physician 

mid -wife 
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9. 



How many visits did you make to a physician prior to birth? 



1 5 9 

2 6 more than 9 

3 7 

4 8 

10. Did you take your children to a physician for a check up after birth? 

ye s no 

11. If yes, was this a routine check up or was it because of illness? 

Explain 

12. Have you had the following immunization and the approximate dates: 

s mallpox^ 

dipthe r ia 

tetanus 

me a s le s 

typhoid^ 

13. What is the source of your water supply? 
well 

local water company 

other 
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Housing Needs 



Questions to be asked of heads of households 



1. Do you own your own home? 

ye s no 

2. Do you rent ? 

ye s no 

3. How old is the home in which you live ? 

1-3 years 11-15 years 

4«6 years 16-20 years 

7-10 years 21-25 years 

4. How many rooms in your house? 

]. 4 7 

2 5 8 

3 6 9 

5 . Is your bathroom? 

1 inside 

2 outside 



6, Do you have ? 

1 hot water 

2 cold wate r 

7, How many people live in your home permanently? 

1 4 7 

2 5 8 ~ ~ 

3 6 9 
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26-30 years 
ove r 30 years 



8 . 



Do others live in your home temporarily? 

ye s no 

9. What is the relation to you of those who live with you temporarily? 

1 Father 

2 Mothe r 

3 Older Son &: Family 

4 Older Daughter & Family 

5 Other relatives (specify) 

6 Friends 

10. How far is your house from where you work? 

Less than one mile 
1-2 miles 

3-5 miles 

6-8 miles 

9-* 10 miles 

More than 10 miles 

11. What type of heat do you use in your house? 

gas stove 

electric stove 

coal stove 
wood 
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Labor Force Information 



ASK ALL RESPONDENTS 



1. What "were you doing most of last year ? 

1. Working 

2. With a job but not a work 

3. Looking for work 

4. Keeping house 

5. Going to school 

6. Unable to work 
Retired 

8, Other (specify) 

2. How much did you work last year? 

1. 0 (go to green) 

2. 1-3 months (go to pink) 

3. 4-6 months (go to pink) 

4. 7-9 months (go to pink) 

5. 10-12 months (go to pink) 
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APPENDIX B 



Notation and Formulas 
Used for Sample Survey Designs 



Si mple Random Sampling 



X = Estimate of mean Indian characteristic, such as individual income 
£/ _ n 

X = X xi Where: xi = ith element selected for the sample 

n 11 - sample size 

V r: The standard error of the sample estimate of a mean character- 
istic, such as income 



== (N-n) S Where: N 



N 



n 



Z _ 



= population size 

varianc^of the population 
N(Xi-X) & is estimated by s , the 
^ N sample variance 

( X is the true population me 



Z ^ ^2 

s = £ (xi-x) xi ith sample element 

n-1 X = sample mean 

p = Estimate of proportion of Indians exhibiting a particular character- 
istic, such as a given income level. 

p = x/n, where x = total Indi^-ns in the sample with the characteristic 

V = The standard error of the estimate of proportion 
P 

vp^ = (N-n) pq Where: p = the estimate of the proportion exhibiting 

N n the characteristic of interest 

q = the estimate of the proportion not ex- 
hibiting the characteristic of interest 



1/ ^ means to sum over all relevant values. 
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X 



Estimate of the population total for a particular characteristic, 
such as total individual income. 



X = Nx or Np 



vt 



The standard error of the sample total. 



Vf-^ = N^( N-n) 

N n 



Sti'atified Simple Random Sampling 



X = Estimate of mean Indian characteristic, such as individual income. 
L 

X = Where: N. = Population elements in hth stratum 

h li n 

N 



xj^ = Sample mean in hth stratum 



_ 



X, i = ith element in hth stratum 
n 



L = Number of strata 

= The standard error of the sample estimate of a mean characteristic, 
such as income. 



2 



V 





Z V (Nh-“h) 

Nh 




2 

Where: 



S 




2 



®h 



^h 

z (x^i-Xh) 



2 



^h 



-1 
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Variance of the elements 
in the hth stratum. 

Nh __ 2 

2- (^h^"^h) ^ is estimated by 

®h^> sample 

variance. 

X}^i = sample elements in 
hth stratum 

xh = sample mean of hth 
stratum 



Cluster Sampling (Single Stage) 



X = Estimate of mean Indian characteristic, such as individual income. 

ith element selected from enumeration 
of the clusters chosen for the sample, 
number of elements included in the 
sample froiV'. m clusters sampled 
from total available. 

V = The standard error of the sample estimate of mean characteristic, 
such as income. 

(M-m) Sc^ 

M 
m 



Sq is estimated with 
m 

2 (X(,-x/ m)^ 
m- 1 

= total value of elements of cth 
cluster sampled. 

X = total value of elements of all clusters 
sampled. 




Where: M 

m 



Sc^ 



'sr 



number of clusters in population 
number of clusters selected for 
sample . 

M 2 

(Xc-X/M ) 

M-1 

total value of elements of cth cluster, 
total value of elements of all clusters. 




X = 



n 

ilL 

n 



Whe re : = 



n = 



Systematic Sampling 

X = Estimate of mean Indian characteristic, such as individual income, 
n 

X =^xi Where: xi = An individual element selected for 

n the sample. 
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V = The standard error of the sample estimate of a mean characteristic, 
such as income. 

= Approximated as (N-n) 

N n 






is estimated by s 



2 



2 

s 



n 

= ^ ( xi-x) 
n-1 



2 
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1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 

21 

22 

23 

24 

25 

26 

27 

28 

29 

30 

31 

32 

33 

34 

35 

36 

37 

38 

39 

40 



APPENDIX C 

1000 Hypothetical Individual 
Indian Incomes Simulated 
From Indian Manpower Resoui'ces 
In The Southwest Study 
Rovv'7 Column 



1 


2 


3 


4 


5 


6 


7 


0 


0 


0 


0 


57 


42 


180 


0 


0 


0 


0 


33 


57 


141 


0 


0 


0 


0 


5 


73 


169 


0 


0 


0 


0 


92 


13 


168 


0 


0 


0 


0 


54 


62 


189 


0 


0 


0 


0 


41 


99 


174 


0 


0 


0 


0 


83 


60 


137 


0 


0 


0 


0 


84 


49 


182 


0 


0 


0 


0 


12 


11 


157 


0 


0 


0 


0 


81 


187 


19.3 


0 


0 


0 


0 


59 


104 


102 


0 


0 


0 


0 


18 


107 


104 


0 


0 


0 


0 


79 


177 


124 


0 


0 


0 


0 


45 


188 


165 


0 


0 


0 


0 


13 


113 


172 


0 


0 


0 


0 


91 


151 


131 


0 


0 


0 


0 


24 


148 


174 


0 


0 


0 


0 


94 


124 


197 


0 


0 


0 


0 


22 


157 


164 


0 


0 


0 


0 


82 


127 


127 


0 


0 


0 


0 


62 


189 


121 


0 


0 


0 


0 


89 


181 


165 


0 


0 


0 


0 


81 


115 


161 


0 


0 


0 


0 


43 


199 


181 


0 


0 


0 


0 


60 


180 


180 


0 


0 


0 


0 


93 


191 


166 


0 


0 


0 


0 


42 


152 


109 


0 


0 


0 


60 


12 


179 


151 


0 


0 


0 


34 


71 


166 


193 


0 


0 


0 


4 


9 


130 


137 


0 


0 


0 


9 


79 


115 


119 


0 


0 


0 


42 


41 


186 


191 


0 


0 


0 


49 


73 


154 


137 


0 


0 


0 


9 


92 


167 


246 


0 


0 


0 


69 


63 


183 


202 


0 


0 


0 


82 


97 


102 


226 


0 


0 


0 


30 


64 


142 


282 


0 


0 


0 


86 


86 


195 


241 


0 


0 


0 


83 


62 


186 


290 


0 


0 


0 


43 


42 


145 


242 



91 

95 



1 

2 

3 

4 

5 

6 

7 

8 

9 

0 

11 

12 

13 

_5 

16 

17 

18 

Z9 

20 

21 

22 

23 

24 

25 

26 

27 

28 

29 

30 

31 

32 

33 

34 

35 

36 

37 

38 

39 

40 



14 

733 

745 

743 

713 

713 

702 

793 

781 

766 

725 

721 

756 

710 

769 

784 

771 

770 

78 '^ 

774 

746 

747 

763 

738 

786 

719 

795 

729 

795 

775 

846 

856 

807 

893 

888 

841 

825 

894 

842 

879 

862 
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8 


9 


10 


11 


12 


13 


228 


272 


327 


470 


434 


593 


204 


219 


367 


484 


463 


544 


206 


267 


379 


450 


416 


520 


209 


259 


317 


483 


499 


513 


239 


287 


377 


435 


474 


538 


239 


240 


335 


402 


428 


562 


250 


267 


318 


481 


41C 


582 


203 


221 


302 


443 


4G5 


585 


231 


244 


370 


421 


428 


530 


284 


206 


357 


408 


401 


631 


288 


218 


327 


482 


411 


6-^2 


257 


205 


320 


455 


477 


659 


259 


222 


348 


413 


488 


611 


218 


296 


323 


458 


406 


642 


284 


287 


354 


462 


496 


616 


278 


318 


328 


458 


422 


699 


234 


381 


346 


426 


465 


677 


296 


386 


305 


442 


401 


615 


201 


379 


391 


481 


465 


570 


272 


303 


367 


479 


5-3 2 


650 


247 


327 


344 


433 


511 


625 


268 


335 


326 


439 


566 


659 


233 


348 


364 


445 


549 


690 


288 


381 


378 


426 


540 


656 


207 


340 


362 


497 


579 


622 


220 


338 


317 


424 


531 


694 


232 


320 


304 


496 


524 


665 


238 


362 


381 


409 


582 


676 


284 


379 


303 


438 


510 


694 


252 


390 


326 


480 


537 


634 


204 


368 


358 


414 


560 


643 


203 


333 


369 


464 


557 


620 


262 


372 


352 


422 


544 


679 


298 


367 


347 


482 


552 


660 


258 


368 


305 


405 


506 


657 


287 


379 


382 


423 


594 


657 


266 


314 


343 


495 


543 


605 


279 


321 


484 


419 


528 


671 


247 


375 


476 


469 


536 


639 


273 


392 


465 


425 


562 


769 



1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

1 c 

16 

17 

IS 

19 

20 

21 

22 

23 

24 

25 

26 

27 

28 

29 

30 

31 

32 

33 

34 

35 

36 

37 

38 

39 

40 



21 

2309 

2:-04 

2392 

2312 

2308 

2377 

2426 

2450 

2443 

1415 

2403 

2480 

2543 

2583 

2524 

2588 

2515 

2591 

2662 

2684 

2645 

2655 

2663 

2690 

2723 

2756 

2702 

2747 

2729 

2750 

2832 

2893 

2817 

2868 

2888 

2817 

2965 

2927 

2935 

2969 
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15 

804 

808 

889 

852 

871 

860 

878 

814 

875 

891 

865 

858 

862 

816 

836 

888 

849 

861 

835 

934 

994 

976 

965 

994 

922 

990 

933 

960 

979 

920 

943 

943 

971 

939 

905 

971 
957 

972 
930 
982 



16 

988 

916 

953 

970 

973 

970 

1085 

1062 

1030 

1070 

1030 

1005 

1097 

1032 

1037 

1037 

1007 

1091 
1041 

1092 
1009 
1022 
1076 
1048 
1140 
1121 
1159 
1178 
1185 
1144 
1180 
1173 
1135 
1187 
1185 
1194 

1178 

1179 
1149 
1109 



17 


IS 


1157 


1401 


1123 


1490 


1265 


1425 


1262 


1442 


1272 


1427 


1266 


1467 


1289 


1443 


1257 


1404 


1226 


1438 


1225 


1477 


1272 


1478 


1231 


1429 


1268 




1274 


1495 


1267 


1417 


1244 


1446 


1289 


1515 


1270 


1591 


1241 


1562 


1289 


1584 


1384 


1545 


1545 


1555 


1328 


1563 


1332 


1590 


1387 


1523 


1389 


1556 


1355 


1502 


1394 


1547 


1398 


1529 


1378 


1550 


1333 


1532 


1398 


1593 


1377 


1517 


1364 


1568 


1398 


1641 


1346 


1607 


1362 


1678 


1333 


1641 


1403 


1651 


1468 


1606 



19 


20 


1681 


1899 


1528 


1887 


1635 


1862 


1625 


182b 


1 153 


1862 


■ 618 


1859 


_643 


1873 


1687 


1888 


1622 


1946 


1688 


1923 


1670 


1928 


: 652 


1983 


^768 


1979 


1747 


1905 


1778 


1975 


1760 


1994 


1706 


1940 


1702 


1928 


1715 


1925 


1715 


1954 


1761 


1948 


1798 


1932 


1763 


2014 


1780 


2044 


1746 


2006 


1753 


2001 


1735 


2074 


1745 


2087 


1732 


2113 


1777 


2125 


1852 


2119 


1812 


2165 


1813 


2177 


1888 


2174 


1895 


2207 


1820 


2213 


1887 


2231 


1893 


2242 


1852 


2276 


1864 


2263 



93 

97 



1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 

21 

22 

23 

24 

25 

26 

27 

28 

29 

30 

31 

32 

33 

34 

35 

36 

37 

38 

39 

40 
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22 


2 " 


24 


25 


2965 


37 7r 


4553 


6991 


3096 


382^ 


4607 


7216 


3095 


385: 


4622 


7137 


3086 


3895 


4689 


7004 


3083 


3800 


4636 


7293 


3022 


3877 


4630 


7389 


3165 


3961 


4780 


7429 


3147 


391_ 


4701 


7525 


3180 


3945 


4781 


7592 


3163 


3929 


4744 


7524 


3112 


3931 


4731 


7673 


3284 


4031 


4804 


7745 


3257 


4052 


4817 


7833 


3222 


4012 


4869 


7928 


3257 


4013 


4807 


8086 


3212 


4088 


4857 


8141 


3399 


4152 


4940 


8197 


3373 


4193 


4995 


8240 


3388 


4187 


4918 


8372 


3354 


4129 


5023 


8497 


3301 


4195 


5167 


8591 


3483 


4299 


5184 


8688 


3486 


4287 


5243 


8760 


3406 


4262 


5329 


8838 


3404 


4226 


5453 


8979 


3442 


4262 


5502 


9051 


3583 


4359 


5605 


9143 


3525 


4378 


5745 


9297 


3540 


4388 


5847 


9441 


3576 


4321 


5960 


9322 


3522 


4323 


6082 


9583 


3665 


4461 


6144 


9655 


3604 


4401 


6293 


9621 


3682 


4442 


6221 


9711 


3611 


4445 


6395 


9860 


3670 


4428 


6409 


9845 


3770 


4579 


6562 


9939 


3740 


4519 


6646 


10297 


3780 


4541 


6745 


11374 


3774 


4515 


6879 


13600 



appendix d 



Nun' -er St. ndard Errors (k) 

Assoc: .. t i,th Various Confidence Levels 



k 



Confidence 

Level 



1 . 0 

1. 5 

2 . 0 

2. 5 

3. 0 

3. 5 

4. 0 



. 682 
.8 66 
. 955 
. 988 
. 997 
. 999 

About 1.000 




95 

99 



APPENDIX E 
Simulated ^Sample Results 
Simple Random Sampling 



Preliminary Sample - 30 


Eleme nts 


60 


1590 


458 


267 


1780 


5453 


113 


0 


458 


0 


2006 


659 


0 


228 


413 


0 


10297 


127 


8086 


3961 


0 


1490 


82 


814 


593 


1378 


2443 


620 


1975 


63 


- Income (x) 


$1514 




i d Deviation(s) 


$2452 




Regular Sample 


- Additional 


346 Eleme 



Total Incomes ^rawn 
for Extended Sample 


$505, 708 


New Mean Individual Income With 
376 Indians 


$1, 465 


New Standarrl Deviation With 
376 Indians 


$2, 287 


Standard Error 


$118 




‘?y9 7 
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APPENDIX F 



Sample Size Required 
With Simple Random Sampling 
@ . 95 Confidence Interval 



N 


A 


n 


1000 


$50 


905 




100 


706 




200 


376 




300 


211 


2000 


50 


1656 




100 


1092 




200 


462 




300 


236 


5000 


50 


3290 




100 


1624 




200 


537 




300 


254 
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APPENDIX G 



Simulated Sample Results 
Stratified Simple Random Sampling 



Strata 


1 


2 


3 


4 




$ 0 


$1289 


$3373 


$7833 




409 


1384 


5329 


9860 




252 


1135 


3824 


7429 




0 


2263 


3284 


8141 




408 


2443 


4461 


13600 




13 


1231 


4388 


8978 




2 04 


2309 


4195 


9845 




369 


1404 


4321 


9322 




435 


1946 


463 6 


7216 




267 


1179 


4359 


11374 




9 


1932 


5745 


9051 




13 


1706 


4321 


7389 




236 


2113 


4701 


7525 




188 


1009 


6745 


9143 




409 


2888 


3112 


7673 




169 


1979 


3086 


9621 




0 


1438 


4622 


7745 




368 


1333 


3670 


9297 




22 


1022 


4401 


8497 




470 


1628 


5329 


8591 




142 


1345 


387 7 


8197 




358 


1545 


5243 


9322 




43 


1109 


361 1 


9711 




0 


1798 


3800 


7592 




30 


2965 


5083 


8086 




282 


1332 


6646 


8838 




552 


2817 


4804 


9583 




0 


1467 


3222 


7745 




179 


2684 


6409 


9939 




93 


2242 


3180 


8688 


Stratum Mean 


$197. 40 


$1764. 50 


$4457. 23 


$8861. 03 


Stratum Standard 










Deviation 


$174. 95 


$ 587. 53 


$1020. 76 


$1333. 14 


Overall Mean 








$1414. 73 



Standard Error $ 35. 96 




/(Tbj 
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APPENDIX H 



Simulated Sample Results 
Single Stage Cluster Sampling 



Clusters in Population 



1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


12 


13 


14 


15 


16 


17 


18 


19 


20 


21 


22 


23 


24 


25 


26 


27 


28 


29 


30 


31 


32 


33 


34 


35 


36 


37 


38 


39 


40 


41 


42 


43 


44 


45 


46 


47 


48 


49 


50 



TOTAL INCOME OF 20 INDIANS LOCATED 
IN CLUSTERS SELECTED RANDOMLY 



CLUSTER NUMBER 
DRAWN FOR SAMPLE 



CLUSTER 

INCOME TOTAL 



10 


$31, 778 


30 


31, 835 


21 


31,213 


19 


29, 840 


23 


31, 713 


5 


29, 914 


8 


25, 636 


7 


34, 057 


27 


27, 299 


29 


37, 122 


41 


28, 055 


6 


25, 317 


15 


26, 135 


34 


30, 698 


32 


29,996 


39 


26, 182 


20 


26, 403 


48 


26, 357 


43 


26, 045 



Mean Income Per Cluster 
Mean Income Per Indian 
Standard Error 



$29, 274 
$ 1, 463. 70 
$ 29.43 




APPENDIX I 



Simulated Sample Results 
Systematic Sampling 



Total income of 333 elements drawn 
with the Znd unit and selecting every 

3rd one thereafter $48Z, 09C 

Mean individual Indian income $1, 448 



Sample standard deviation 

(Approximate) $Z, Z87 



Standard error 



$1Z4. 97 
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